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Foreword 


The present volume brings into the M I T Studies in Comparative 
Politics explicit consideration of an important range of issues which 
concern policy scientists everywhere Tliat the volume is the product of 
an International Symposium involving the participation of social sci- 
entists from many different disciplines and many different lands en 
hances its contemporary interest and. we believe, its lasting importance 
Among the many merits of the volume js the fresh focus it brings 
to the analysis of trends Disciplined knowledge of the past is basic 
to the analysis of conditions under which historical turning points have 
occurred and to the projection of alternatives under which future 
turning points consonant with our preferred values may be helped to 
occur We especially welcome, therefore, the studies of trends presented 
as “historical dimensions of ecological analysis" (Part Five) 

The analysis of conditions under which trends occur— the "manifold 
of events,” in Whitehead’s phrase, which produces what we are calling 
a “turning point” in any particular time space configuration— has con- 
cerned social scientists over many years Indeed, there arc some who 
consider this the proper business of social science qua science In (his 
context we welcome (he businesslike studies gathered under “multi- 
variate analysis m political ecology” (Part Three) as well as “factor 
analysis and ecological typologies” (Part Four) 

Consideration of typologies usually leads social scientists to rethink 
the fundamental structure of empirical social research The participants 
m this co-operative enterprise have chosen, again w isely, to make their 
rethinking of these issues explicit and public The chapters subsumed 
under “the logic of ecological inference” (Part One) are a valuable 
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contnbution to the continuous rethinking of fundamentals, which is 
indispensable to the scientific enterprise Equally important for any 
student of the effort to shape the “behavioral sciences” over the past 
two decades so that they can deal with political and other “systems” 
IS the group of studies concerned with “individual behavior and collec- 
ti\e properties” (Part Two) 

Among the high hopes of social scientists in recent years is the 
creation of a ‘ world observatory ’ This requires social research to 
become “cumulative’ m order that statements about different time- 
space configurations of individual and social behavior may become 
genuinely “comparative ” The construction of “operational indices,” 
upon which these high hopes depend, can proceed only in the measure 
that we build data banks in such a way that observations made in 
different places at different times can be ordered as “functional equiva- 
lents” and thus can be collated and compared The chapters concerned 
with "organization of ecological data archives ' (Part Six) provide a 
reasoned response to these high hopes and the associated demands that 
social scientists have pul upon themselves and each other 
We welcome, m short, a volume of studies that respond to the real 
needs of “comparative politics” m our time— a volume that, in so doing, 
may help to build the developing science of democracy throughout the 
world 


May 1968 


Harold D Lasswell 
Daniel Lerner 



Preface 


Tuts VOLUME IS an outgrowth of a long history of exchange and co- 
operation among European and American social scientists The core of 
this movement was made up of electoral analysts social psychologists, 
sociologists, and political scientists eager to apply the latest techniques 
of data collection and statistical analysis in the study of modes and 
trends of mass politics in the advanced democracies of the West This 
group never reached the stage of formal organization but mamiaincd 
frequent and fruitful contacts through a circuit of overlapping member- 
ships in such bodies as the Research Committee on Polilieal Sociology 
of the Inicrnaiionai Sociological Assoaation and the Standing Com- 
mittee of the International Social Science Council, various joint 
publications m the field of comparative political analysis testify to the 
strength of these ties 

The Europeans in this core group were torn between two styles of 
political research they were attracted by the great potentialities oflhc 
sample sur^e^ developed to such a high level of precision by Ihcir Amer- 
ican colleagues but at the same time felt heavily committed to the older 
European tradition of geogrop/wa/'Ceo/ogifo/aeo/u/j expressed in such 
admirable works as Andre Sicgfned $ Tableau Pohuque de la bronet dr 
VOucsi (1913) and Herbert Tmgvten’s Political BehoMour (1937J The 
sample survey appealed to them because of the greater wealth of 
variables and the directness of the analysis The Siegfried-Tingvten 
style of analysis limited itself to a smaller range of variables but allowed 
detailed exploration of variations in geographical and historical depth; 
through the collection, unit by unit, of data from olTicial statistical 
sources it was possible not only to identify local deviations from 
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nauonal and leg.onal averages go furlhcr back rnt.me than 

any sample survey would allow back to the earlier phases of democrati- 
zation and parly formation , , ^ 

The balance bclvtcen these two styles of research was deeply afTected 
by changes m the technology of analysis The sample survey came m 
with the sorter counter and the tabulator This equipment fitted the 
requirements of straightforward survey aoalvsis very well but did not 
prove very helpful in work with data for ecological units With the 
arrival of the electronic computer all this changed the greater capacity 
of these machines obviously had its cfTccls on the level of sophistication 
of survey analysis, but the impact on ecological work was much more 
profound The earlier attempts at ecological differentiation had been 
excessively simplistic pcTCcniagc and decile distributions, crude lypol 
ogles, calculations of differences on dependent variables The electronic 
computer opened up a vast range of further analysis options for ecolo 
gists correlation matrices, regression and covariance analysis, factor 
analysis, causal path analysis, calculations of systems of simultaneous 
equations in the econometric style The decisive change in the balance 
between the two traditions of research came between 1960 and 1965 
even the most ardent believers m the survey techniques found themselves 
attracted by the possibilities of longitudinal analysis opened up through 
the establishment of large scale files of aggregate data for computer 
analysis It is deeply significant that it was that great Mecca of the 
American survey tradition the Center at Ann Arbor, that took the 
miiialivc which led to the establishment of a computer archive for 
historical ecological data for the United States 

One consequence of this change in the technological conditions of 
research was a realignment of scientific alliances During the era domi- 
nated by the survey technique, the analysts of elections tended to main- 
tain their closest links with social psychologists and students of social 
stratification and mobility W iih the arrival of the data archive for aggre- 
gate statistics a new set of alignments began to take form on one side, 
the analysts of elections found themselves Jess and less attracted by the 
cartographic concreteness of the school of electoral geography, on the 
other, they bwamc increasingly aware of their intellectual affinities with 
thci/f/wneco/o^M/jand the econometricians ofthcregm^To/jc/c/icemove- 

ment As soon as the analysts of electoral behavior moved into the 
compuicriration of their data for local units, they discovered that a 
n^l^r of their analytical problems had already been thoroughly dis- 
sected by s^iologists and economists working on spatial variations for 
purposes of urban and regional planning Their dependent variables 
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dtfTercd, but the>r analytaal procedures had a number of features m 
common 

Tile present volume represents a step m this spelling out of new 
convergencies m the study of terntonal and spatial variations In our 
joint Introduction, wc have tried to sketch some of the background of 
these trends The chapters in Part One focus on the crucial issue in (he 
debate between the sample surtey enthusiasts and the afiaonados of 
the ecological method the h\elof-<mahm feUaaes The chapters m 
Part Two concentrate on the concrete possibilities of level to-leve! 
linkage through contextual analysis and the development of joint 
“macro micro" designs Parts Three, Four, and Five offer illustrations 
of the wide range of analysis options in ecological research the various 
techniques of multivariate regression, covariance, and linear causal 
analysis now used in political research, the factor amlytica! techniques 
used by urban ecologists and geographers, and the newer methods of 
time senes analysis currenily explored in historical studies of demo- 
graphic. economic, and political change The final section of the volume 
concentrates on the concrete and practical tasks of ecological data 
arclming this shows how the arrival of the “third generation'* com 
puters opened up extraordinary possibilities of expansion, both back 
into history and across a variety of distinct files, and how work has 
moved ahead in country after country, not only in the United States 
but in several European countries and m Latin America 
This volume could not have been assembled without the generous 
help of several organizations, research centers, and individual scholars 
Drafts of about half of the chapters were originally presented at a 
symposium organized by the rnlernaiional Social Science Council with 
the assistance of UNESCO, the Ecolc Pratique des Hautes Etudes, and 
the Direction de la Cooperation of the French Ministere de I'Education 
Nationale We arc greatly indebted to these organizations for Ihcir 
generous support and are particularly grateful to Kazimierz Szczerba- 
Likiernik and Clemens Heifer of (he International Social Science Coun- 
cil and to Sarny Friedman of UNESCO for their help and their patience 
We wish to rccon^ our cAattis' i<t Ctfttxf 

National de la Recherche Scientifique in Pans for his contribution to 
the organization of the symposium 
Finally we are both happy to acknowledge our fasting indebtedness to 
the research institutions where this volume was planned and or^nizcd 
(he Centre National de la Recherche Scienlifiquc and the Centre d'Etudes 
Sociologiques m Pans, the Cbr Michclsen Institute m Bergen, and the 

Center for Advanced Study in the Bchaviond Sciences at Stanford We 
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do not hold any of these institutions or indi»ji ^ ► ->■ 
OT omissions in this volume, but we do wunt t' -"is 
volume v/ould never have seen the light of C4> -> - 
support. 


'iifetc for errors 
't rl clear that this 
' their generous 
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’dattej Dogan 
S’ein P.okkan 
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The need for concerted effort to ad%ancc such cross-national 
ecology had alread> been spelled out in some detail at the large scale 
International Conference held at Yale University in 1963, the discussion 
of the archiving and analysis of aggregate tsationa! statistics naturally 
led to an examination of the possibilities of systematic studies of 
Kit/iw naiion variations not just at the level of the individual or the 
household but quite particularly at the level of the consfiiuent ter- 
ritories and communities * 

Statistical studies of local and regional variations have been part of 
the stock m trade of the social sciences for more than a century, but 
the vast majority of these studies were limited to the single countries, 
only a scattering of attempts had been made at quantitative comparisons 
across several nations of the directions and ranges of such variations 
By the late 1950 s however, the accelerating demand for research on the 
conditions for economic growth and social and cultural modernization 
had added new dimensions to such studies of within nation variations 
and prompted a variety of methodological innovations The great 
technical no^etyor.he I930sand 1940s the systematic sample suncs. 
had prosed of immense salue m the study of sociocultural sanations, 
but the samples generally allowed only fess and sery crude breaWossns 
Th' St~'«t resource for systematic 
"" ^ffnitously accumulat 
OKr^onT '"“"’"“"ons and booUcep.ng 

daTm^ts "> “'I'' '"'o •'’“o 

data masses and to lest nesv methods of analysis The result ssas a 
rtmSandrT""'"'' of '=nat,ons dunng 

vaf,e.rorlr/p<Lr!i‘'H'’^‘' ''T from a ss.de 

economists anthropoloEi!tr*,”'*i'"’"‘’""'^'"^’°"“"* geographers, 

"« .sr. rs ,ts 

Roklan andSafcn, 

S hy At.,d. aad R„h™, 

Ite aOKle by R.CSCO ’•'“"'o'' I966) and 

pr?Lt%’‘pTa 
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vamtions hisloriiins and geographers gasc ptiorit> to cartosraphic 
methods, anlhropologisls collected mformalion about the "lit" 
between the natural habitat the state of technology, and the orgamta. 
tion of society, economists dcselopcd techniques for the studj of 
vanattons in physical and tabor force resources, m distance to markets, 
m the localiaatton of mdustrtes, sociologists studied dimensions of 
dilTeremiation among runl. suburban, and orbtti communities and 
among sectors and zones of ihe metropofiian areas, poljljcil scjcnusis 
focused on terntonal vanatcons in the sire and compfcxitv of admini- 
strative structures in the entrenchment and activity of parties and 
interest organizations m the behavior of the ciiizcnr> at Ihe po))s in 
referenda or in strikes and demonstrations There was verj little 
co-operation across these divergent intellectual disciplines, and 
the debates among the practiiioners of these differing approaches 
often tended to generate confusion rather than promote clanfica- 
tion 


LEVELS AND DIRECTIONS OF "ECOLOGICAL" RESEARCH 
Some of the confusion in these debates across the disetpirrtes is 
reflected in the multiple usages of the term "ecofogj ’’ 

The original usage is quite clear Ernst Ifaeckel introduced the 
neologism as a term for (he study of the influence of the physical and 

biological environment on thebchavioranddevelopmenl of organisms* 
Haeckel had confined his research to plants, but essentially the same 
perspective was soon to be apphed in the stud) of animals 

A distinctive school of ffutnan ecolog) first developed within 
the sociology of urban commuJiiUcs.'' but similar approaches to 


** Ernst Haeckel t«W7) further d»cu«.on 

of Ihe hisiorical contevi see R C S»u/Trr ’ Haccket Dan^.n. mJ EcoJesy, 

of human wmmumuei 

and behaviors were the Chusgo jporlo^isis Robe” C. Park and t mevi W ^rFe«. 

1950) For 3 broader «Ia .on OMe«s^ (Cxamton Row. I9'S.I» A ovrful 
Theodorsen I ^ ^ hff»frnrl3nJ amniat.BndhumineCT>IocyrtO D Ouncan. 

discussion of the linkage* bei««»^n 



4 


MATTEI DOGAS ASD STTIS POkkAS 


the Study of environmental mllucncc on human insiitulions and 
human behavior soon developed within cultural and social 
anthropology “ 

The term “ecology” soon won general acceptance in the social 
sciences although usage varied from discipline to discipline In its 
broadest sense ‘ ecology” was used to cover all varieties of research 
on the adjustment of human beings to their environments A recent 
reader m human ecology has chapters on the climate and the soil as 
well as on the sociocultural community contexts of behavior, it 
even includes a discussion of man’s possibilities of adjustment in 
entirely new environments, such as the depths of the ocean and outer 
space’ In a narrower sense “ecology” has tended to be used of a 
wide variety of approaches to the study of spatial and tcrntonal en- 
vironments of human activity, whether defined in smaller surface 


units, such as tracts of agricultural land, villages, neighborhoods, 
blocks or precincts, or in larger administrative units, such as 
communes, cantons, counties, constituencies, or even entire national 
polities 

Even in the narrower sense the term "ecology” has been apnhed to 
a w,dc vancty of approaches to the study of the spatial territorial 
environments of human behavior These divergencies can be mapped 
divert ‘liaiansions the levels of variation distinguished, the 

elTons. the availabiliij of data level by 
suid’es f handling the data All ecological 

tL i^mZr ."'"’"1,°" ''“"“hons among units at some level atove 
of deSen 1 n, 'h'"- ahoiees 

dam^::dhng"™“" ■" and 

(on“varrabl«"]ueh ” ’'^"'“'on variations 

affiliation, parly prefereneef and v, ^ "'“““on, religious 

units (villaces rural n ' ^ variations among proximal territorial 

(villages, rural communes, and precincts or wards within urban 

(o^LlmTat“thfElven^|e,eh''or'r' rf'n 
teristies of the subordinate or 

•' or the superordinatc units) This 


Adduon VVolq., 1965, d Colletoon 0/ Rradmx, (Cambridge 
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g,ves the Tour poss.h, llt.es shown tn the ..ecompanj.ng table' 
r> n Demcd Data 


Indinditals 


Terntona! 

units 


Personal atirtbules (sex, 
age, etc ) or behavior 
characteristics (church 
attendance, voting) 

Conie'slual data member- 
ship of. exposure to. 
territorial unit of given 
global or aggregate 
atlnbutc 

Global attributes 

characterizing unit as a 
whole, not derivable 
from data on individuals 
in the unit 

— 

Aggregate data unit 
characteristics derived 
from distribution of m 
dividual attributes or 
behaviors 


k f the four ecus — 

In practice, data m one ° political adminislrali'e 

at hand they may ‘ ,o any ■•direct" methods of 

reasons, and it may be too late to resort 

data collections ,a„ge of areal units but no 

There may be data for tndtitdua's a^ oommun, y 

way of Identifying the ./ftoquenlly Ihe case m 

contexts of their behavior This is fr^^ of secr^, or 

analyses of nationwide sample 5“ > oo longer be any 

econW or sloppy adm™' ^^jUnls -o any ^■'0''" 

possibility of allocating individual 

primary sampling units frequent no ,„nal 

The contrary situation is established for lerritona 

at hand, but nggrejnle d's^but ons h« ^ „r official 

units at different levels This is the case 

pi **'“ * , II 

.• For early slalemenls of Ibc "rJMChlSVr.; 

,„P F ‘a.mrsfeld and Mpn;^_^_,^,„,,„.„rol.u^^^^ 

For further devclopmcnl s« J A^^^ ^ ^,9^,) 

sition,” nrt,pnCt*s # pmiO „/■ Sucw' 

“Prornetfis >''^"“’^'^"'■‘5 II and V ofhis t ^ UrfAV* / 

(1963) PP 275-299. and Chapt^ " Q,„u„g T^ry 

(Pans Plan, '’fj, 'pSi 
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Statistics the primary individual riata have cither been Upt ^ 

uL as m elections or referenda, or cannot be made available for 

lt::r‘;t«ofeeo'no.n,c reasons, as vvill often bcthceasefor census 

Hf.fa school erades, tax records, criminal statistics 
What are the typical strategies m Ihe face of such 
access’ There arc essentially four options On the one hand, a social 
scientist may choose to focus his study on the explanation of variations 
at the level of the wdnidual or he ma) aim at the explanation ol 
\analions at the le\cl of the ternional unit On the other hand, he may 
vrish to limit his research to one level at a nme or he ma> be more 
ambitious and consider the processes of interaction beli^een the rwo 
le\eh These options can be presented as shown in the accompanying 
table 


Lenelof rocus OF As*i-vsis 

Depe>dest 

Variable One Leiel Interaction of T*^o Letels 


1 I Biher individual 

1 data (e g., from 

1 surveys) treated with* 

1 out reference to 

territonai contexts 

Or temtonafaggregate 
data used to analyze 
individual variations 

III Either, individual 

data used jointly with 
contextual data for 
territorial units 

Or aggrcgate/global 
data used to lest tnier- 
aciion between levels 

11 Aggregaic/global data 
for lemtonal units 
used to desenbe and 
account for variations 
at tcmtonal level 

IV, Ei/Aer: joint use of 
individual/aggrcgatc/ 
global data to test 
sources of change in 
temtorial structure 

Or aggregaie/global 
data used to test inter- 
action between levels 


poliiio? *^'i»'Portancc of secrecy in the organtzation of 


^ 21 fm-i, “ of Personal Privacy, The 


Jf,. “The Idea c. « ... 

American Stamucan 21 (19671 ^^5? ^ Personal Privacy,** The 

synemz (data about mdividualV « nrf,wd.iiT" * b««ten wtelhgence 
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establish equations fo cnoiahst) 
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sample surveys m a stratified sample of ecological contexts varying 
along the dimensions you want to study) and proceed to analyze 
individual behaviors as a function of their spatial-territonal context,*^ 
or you continue to stick to data at the aggrcgate/global level but 
develop statistical models for the exploration of alternative linkages 
beuveen individual distributions and higher unit characteristics You 
may, for example, have established a correlation between aggregate 
percentages of rented apartments and Socialist votes, but you do not 
feel confident that you can infer from this correlation any statement 
about the distribution of individual votes by type of dwelling You may 
be able to specify upper and lower limits to your estimates for each cell 
in the “black box,” but to increase the precision of your analysis, cither 
you will have to invest in some direct field work, or you will have to go 
further in the specification of your model and bring in more aggregate/ 
global variables (percentages of municipally versus privately controlled 
apartment houses, the party or parties in control of municipal govern- 
ment, the degree of class homogeneity within the territory, etc ) to test 
the alternative expectations against the observed aggregate distributions 


T}pe IV 

You may finally opt for a “micro-macro” design your dependent 
variables are the characteristics of the tcrnional social systems, but you 
seek your explanatory variables not only at the global/aggregate level 
(such as geographical configurations, concentration factors, distance 
factors) but at the level of mdmdua! decawns There is an interesting 
para e cre between the work of political sociologists and urban 
ecologists fn the field of electoral ecology there is a clear need for a 

levels The 

^havtor of he etl.aens cannot be understood without tnowledge of the 
a„d?r,'™ •he"' by Part.es, electoral arrangements, 

Zt j.' , alternaltves (such as the ma.nten- 

bravmrs o? are, tn turn, dependent on the 

cfam ensl r" a'' the •‘macro” 

ahernatne Ira “ variety of eonstramts on and 

age, quahty, s.re, and cost of the dwell, ngs and the soe.al charaetenst.es 
Sch^h in Part^oVr*^*L ’b* rhapicrs by Juan Linz and Erwin 

« (TO3), pp" nlln'"'"®'' 
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of variables considered and restriclcdthcm/w&rr of variables taken into 
account in any one analysis Variables tended to be judged by their 
immediate graphical covariation with the dependent variable rather than 
by their share in the total variance there is no obvious cartographic 
equivalent of the partial correlation The reduction of territories to 
points m a scattergram or units in a statistical calculation clearly cuts 
the analyst off from a variety of rich spatial cultural cues to an under- 
standing of the processes at work but at the same time forces him to 
clarify the logic of his reasoning and specify the evidence for his 
conclusions 

For a variety of reasons the devotion to social, cultural, and political 
cartography was particularly stubborn in France The German and the 
American attempts to develop a ‘ statistical geography” did not find 
much response there until well into the fifties This was not a methodo- 
logically unified movement Several parallel efforts were pursued m 
isolation from each other there were attempts to refine the early work 
by Durkheim on the ecology of deviant behavior and mental disease,*® 
there was a continuous movement toward greater statistical refinement 
m the study of urban development and urban structure,*’ there were 
signs of increasing interest in the construction of multivariate typologies 
of areal units, whether cities or rural units,*’ and there was a great 
variety of efforts to build the foundations for a discipline of political 
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II 


Bv science*' 

'"“’''"S emphasis Vias also given to He temporal 
dimensions of such research The great ware of studies of developmental 
proccssespwheihcf economic cuUuraJ or political^ had tempted a 
number of scholars lo try out new approaches to the analysis of local 
variations geographers such as Torsten Hagerstrand economists 
such as Walter Isard “ and political scienusls such as Karl Deuisch/* 
had all m their various styles suggested ways of formulating models for 
me spread of items of ‘modernity (production facilities consumption 
habits, vocabularies slvijls occupations styles and norms of Imng) 
from central to peripheral areas and proposed techniques for the 
statistical study of such processes of change in space and in time By 
the 1960 s these striking convergencies of orientation across tradition 
ally very distinct disciplines had created much ferment m the adtanced 
countries and tempted an increasing number of scholars to explore 
new models and new techniques in the study of time space sanations at 
different levels of the society and the polity 


THE EVIAN SYMPOSIUM 
OBJECTIVES iSSUES PERSPECTIVES 

It was against this background of long standing mtelieclual concerns 
and exciting theoretical and methodological innovations that the 
International Social Science Council took steps to prepare a first 
Symposium on Quantitative Ecological Analysis in 1966 


^‘KennCox op ett djsnjjsfslhedilftrcncfsbclvfeenireogfaphicatandpofiitco- 
sociological approaches to the siudy of ccmlonal varraiiorts and reserves for 
eiccforal geography the lask of identifying desenbing and analyzing the dis 
Imciive regional ossaaolions of yartabfes at the sobamitrse teset , in practice Ihfs 
amounts to saying that the geographers laslc begins when the standard analyses of 
sociologists or political scienDsis have identified the areas that aeviare mow con 
ehtenity from the mean expectation wiihiit Ihe given temtomJ univerw This is 
hardlyastabledemarcationofihedisciphne iheboundatyisafunciionofthenumber 
and sophisncalion of the variables brooghc into the m«ll.p/c regrnjion equ^ions 
« T Hagerslrand Innoranansfortamf kordhgisU sympunki (Lund Glcerup 
1951) The Feopoganon of Innovation Wares (Lund Lund Stud « In Geography Ser 
B No 4 1962) A Monte Carlo Approach lo DifTiKion turopi'ennei 

(eds t Education and Economic Deiflopmeni (Chicago Atdme 1965) 

“W Isard"/”; Methods of Regwtal Anotym An Inlrodicnonlo Regionot 

Science York MSriation and Pohlical IJeveJopment. Amencan 

foliucal Science ReMeiM^SSa^bX) PP 49J-514 



12 


MATTEI DOGAN AND STUN ROKkAN 


The objectives of this initial international gathering ucrc formulated 
as follo^^s 

1 To provide an opportunity to discuss in some detail experiences in 
the planning organization and operation of ecological clalaarchncs And 
to examine plans for joint international action to ensure accelerated 
development of such facilities 

2 To ensure effective confrontations of diffcTingsuhsianine emphases 
and methodological traditions in the use of qiiantilaine data for localities 
and to examine the pros and cons of alternative designs of analysis 

3 To review the possibilities of joint strategies at se\eral leiels of 
aggregation, particularly the possibilities of combining sample sun cys of 
variations at the level of the individual with ecological analyses of the 
proximal contexts of such variations 

4 To examine recent experiences in the deselopmeni of ecological 
analyses m historical depth, particularly through the organization of 
data files for local and regional variations before the decisive break- 
through to economic growth and during the subsequent processes of 
urbanization, mobilization, and modernization 

On each of these points a great variety of efforts have been registered 
at the national level, but very little has as yet been done to ensure 
effective international co operation 

The international Social Science Council has taken a first step 
toward the organization of regular interchanges among archnes of 
ecological data “ It has established a Standing Committee on Social 
Science Data Archives and has set up under this committee a “task 
force to look into the possibilities of joint action on technical and 
methodological problems in the accumulation, storage, and retrieval of 
data for localities 


nrL or d.soissions or Ihe tog.c of aggregate aoal)S,s was 

f “"forence m 1963, the comparison or aggregate 

samf " T scientists with essentially the 

Auh F “Mtegates ror regions or com 

t the Evian symposium these issues were at the heart or the 

“ £' Rair "" ^‘•"<■1 ^rleocrs op c, 

Saenee Information 6(1967) pp ^nd Prospects * Social 

Rokkan (rfs I'^Conirc'iw Awioti th'’ r^™i.“' '■’'O'"' 

Allcer a leader in the Yale efTorti of further reflections by Hayw-ard 

««) ct Alker, c'h%'’'“' 
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debate, there was abundant evidence that the earlj aiiacXs on the 
ecological fallacy’ ” had given way to serious cfToris to spell out the 
assumptionsand the hmtfalions of inferences from one level of descrip- 
tion to another and to develop better tools for the handling of aggregate 
data ” The field is m a great ferment and there is a dcfinite'’necd to 
develop better channels of commumcaimti among all social scientists 
currently engaged in the analysis of ecological data masses 
The Evian symposium also offered a first opportunity for detailed 
discussions concerning analysis techniques and research strategies 
The experts gathered for the symposium unanimously agreed to rccom 
mend that such exchanges of techniques be followed by the organization 
of mtcrnalioml work teams to test alternative procedures on several 
bodies of national data It was strongly recommended that ‘data con 
frontation seminars be organized for ecological data along the lines 
currently tned for survey data experts on ecological analysis would 
bring their data with them to an analysis center and work out joint 
analysis programs through detailed discussion of data chvraetenstics 
techniques of data handling and models of explanation A plan for 
seminars of this type has since been presented to UNESCO and the 
first in the senes is likely to lake place in 1969 
The 1 nternational Social Science Council has also given consideration 
to the planning of comparative research on heal ume h^s in processes of 
fiisiorical change and has been concerned with enlisting the co-operation 
of historians in such research At the Evian symposium one session was 
focused on problems and potentialities in the development of compara- 
tive historical ecology This is a field where the French schools of 
historical demography and regional bislory have made particularly 
important contributions 

At the Evian symposium the principal concern was with local 
variations in economic social and cultural conditions before the 
decisive thrust or urbanization and mdustnalization m the nineteenth 

» The Inma! siaiemeni of the fallacy of wfcrc^ from ccolog.^1 to i^.v|di«l 
correlations IS due loW S Robinson Ecotopral C^la'iom andlhe^tav^^^^ 
Individuals. American Sociohgitol Koien tJ (1950) pp 55! 557 For furlbcr 
toonL byA)to Altod, 

Variance Componenis Model of Pol tirai^ci* ' 

Folmal Arnold >«» W 
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century Peter Laslett set out the Work of the Cambridge Group 
currently endeavoring to establish benchmark data for the study of 
changes in demography and social structure in England during the 
seventeenth and the eighteenth centuries, this represented an attempt 
to apply to available English records of the techniques of family 
reconstruction from parish registers developed with such success by the 
French school directed by Louis Henry The French historian 
Emmanuel LeRoy Ladurie, himself a leading analyst of seventeenth- and 
eighteenth century developments showed how the data on regional and 
local variations m the early phases of nation building could be linked 
to corresponding information on the acceleration of these processes of 
change in the nineteenth century He did this with the aid of a large 
collection of maps for regional variations in France, some of them 
based on records and estimates for the anacn ri^wie others taken from 
an extraordinarily interesting volume from 1836, d Angcvillc’s Essai sur 
la slatisJi<iue tie la populaltott francatse ** 


The International Social Science Council hopes to link up these 
French cfToris with similar projects in other countries The recent 
scholarly discussions of dimensions of political development have led to 
increased interest in the possibilities of establishing local variations in 
rales of change during the nmclecnlh and the twentieth centuries In 
th«e further efforts it will be essential to jink up the work of “pre takc- 
ofT conditions with analyses on the variations in the rales of change 
during the decisive periods of cultural social, and political mobilization 
To cite a recent example the great strength of Charles Tilly’s study of 
the ecology orihc Vendee during the French Revolution is that it ties in 

to. Andre Siegfned's 

yrarTlato””^ of W«t of France a hundred 


with already been discussed 

y ar or tfo On"? * r*^ d-n® ‘ho 

year or two One line of actmtv fomo^e: r... »u . ° ^ 

work of h,:, or, cal do, a arclorl e ‘ 

already been taken by the DATUM ‘“'‘00“°" has 

Will ii hoTVft tv. 1 L- j ^ ®*^Sanization at BadGodesbergand 
will It IS hoped, be linked op »,th similar efforts ,n the United States 


See Chapter 19 of this volume 
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Quantitative Ecological Analvsis m ihe Sooa/eJ^ Paper for the Symposium « 
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under {he aegis of (he American Historii^I Association and the Inter* 
University Consorlmm The standing commitlee of the Internauonal 
Social Science Council wjll have an important roh to play m these 
developments Perhaps the most interesting proposal to come out of 
these inlcrnattonal discussions ts the idea of ohm confroniation seminars 
for historical ecological data, these go beyond the traditional discussions 
of already “frozen” naUonal analysis toward joint computer processing 
of (he raw data brought together by (he participants from each country 
This innovation m the organization of international research co opera- 
tion will be tried out for post-World War tl data by the Intcr-University 
Consortium at Ann Arbor m 1969 Once the technical difficulties of 
such joint processing operations have been solved, it is expected that 
this style of co-operation will assume increasing importance m com- 
parative research on national developments 

Closely linked with these cfToris of historical data archiving are the 
plans for the preparation of Ouuhs to Data for Comparatne Research 
The iniliat work undertaken in the field of electoral statistics’^ will, we 
hope, be followed up by parallel cfToris to account for variations m the 
avaiiahihty of histoncal statistical indicators by locality for a selection 
of countries A trend report and selected bibliography on ecological 
analysis are being prepared 

Finally, a third set of plans focuses on the systematic exploration of 
alternaine moJels of nation building and center penphcr} contrasts^* 

A first conference of historians and social scientists interested m (he 
comparative study of processes of change during the crucial periods of 
centralization, urbanization, mass education, and democratization from 
about 1750 to 1914 will, it is hoped, be organized some time during 
1970 and should, if the necessary finance can be found make it possible 
for the International Social Science Council to launch a longer term 
program of co operation between statistically oriented historians and 
dcvelopmentally oriented social scientists 


“See S RoVkan ‘The Comparative Study of Eiecioral Siaiisiics 
Science Wormaiion 5(19661 pp ^19 Thefirst volume of the /n/ernalWlTufii’ero 
IS scheduled to appear lit 1969 

Fora review of researchmihB field andaset of sogsestions about possible steps 

m Ihe organiaaijon of internatioital ciMipcration sec S Rokkan Models and 
methods m the Comparaiive Study of Nation Budding Acta sociahgica (lo press 
1969) 
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The Logic of 
Ecological Inference 


These four ehaptcrs eantau le oi r cntr<.e <rn mstifire th^ seek lo elari^ the 
maheJolo^teal eentcm aj the current ejerts tc muie fuller and more meaningful 
us« ^ {ht vast toJ}« of ooihhle stainucs h/ tctnioria} units 

kail Deutsch sets the general framework for the further discussion Hai mg made 
signal eantribuilons to <)uaMitati*e analj/sis at the levels and local 

ities as net! as at the levels of national polities and regionel political ^sterns, he is 
in a hecter position than anj other social scientist to map the tenitories for promising 
adranees anJ to plaev the eontnbutions of the ecologists in a brood iheotelieal 
perspecin c 

The other three chapters all focus direetijr on the ecological fjDac)’ the/ am 
shtute so many commentaries on the rloesicol article by \Y S ^hmson HaynarJ 
Alker gcneiaUres the Robinson argument and shows Aon the ecological fallay fts 
into a broader system of Iciel to level shoncuts m anal)tieal procedure The luo 
representatives of the great finnuA school of social ecology, Brik AHardt and 
Tapani lalkanen, take up the challenge <f /Ac Robmson article and tatse impor 
tone issuer of research siraicgj the old warntrtg oguinst mferente from ecological 
to individual coionations should not fiist be treated as onotAcr * ted light" in the 
coniro) if the research trffc Aw jIioiiW fre useiJ as a sptingboatd fit lf>c develop 
menc f new methods of two IcscI anaijsis TAc level by level t/iscrf/’ijncieJ in 
tfic i/frcftion and strength of correlations amsSttute an important field of research 
in its own right mJlildual bchanortdo not foit vary directly with ecological 
characteristics, they interact dynamical)/ with them Uhat Rnhinson did was 
simply to point to the probkm the n ti step is to develop o strategy fir rescorck 
on these processes of level with level interaction The 1950 warning neighed 
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heavily with the generation oj research norhers brought up in the tradition oj the 
test and the sample suT\cy it kept themjiom the rich bodj of ecological injbrmation 
produced by the governmental bookleeping machtnerlci. The current generation oJ 
social scientists has been freed from the pressures of this academic superego: to 
them Robinson docs not set a barrier hut poses a problem for serious research. 
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On Methodological Problems of 
Quantitative Research* 


I HAVE CE>^CROUSLV be«n alloHcd the task of illuminating the general 
methodological context of quantitative research one of those neat, 
sharply defined topics I propose to examine three major functions of 
quantitative methods and data m the social sciences, and will later 
proceed to a discussion of some of the implications for the next steps 
in our work 


THREE AfAJOR TASKS OF QVANTJTA TJVE METHODS 

The first of the major functions is the elaboration and specification 
of existing theories and existing sociological knowledge The second is 
the generation of heuristic suggestions about new phenomena, new 
relations, new questions, and new theories This involves the develop- 
ment of our powers of recognition and our tools of recognition for 
new phenomena Then, in the third place, v-e shall deal with the use of 
quantitative data and methods for deciding particular theoretical 
questions through processes of strong inference, and, insofar as this js 
successful, we shall consider the possibility of a substantial general 
acceieration of the growth of knowledge which is both nontrivial and 
at least partially verified From a general discussion of these three 
functions, I would then like to go on to their implication for some of 
the next practical steps which social scientists could take 

• Research uiiliicd m this paper wassuppMted in part by the Camegre Corporation 
and m part by the National Science FoundaUon and by Ihe National Institutes of 
HeaJlh through ihc Menial Health Research Insutute at the University of Michigan 
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Making Existing Theories Richer and More Precise 
Let me begin with a discussion of the first point the elaboration and 
specification of existing theories and existing knowledge Here, I must 
go back to what looks like a \cr> abstract philosophical question, 
namely, what, for our purposes is meant by kiwi^ledge At this point, 
1 base to take issue with the philosophy of Sir Karl Popper, who has, 
I fear, obscured this subject 

Exisieniia! Sialemeiifs and Disco\crtes As I sec it, knowledge involves 
first of all a collection of exisiential statements, that is, precisely the 
kind of statements Sir Karl Popper thinks are of only secondary 
importance ‘ An existential statement is a statement that has the logical 
form of “there IS ' for instance, “there is a continent west of the Atlantic 
Ocean ” Existential statements are the \cry essence of discovery 
Disco\ery usually leads to new existential statements “There is a 


planet Neptune and "X has the property Y" are logical forms of 
existential statements An existential statement is a discovery either of 
a thing or of a relation If you diSco\er a relation, the statement has 
the form of “if then.” such as “if amber is rubbed with dry cloth, 
then It will attract lint,” which is one of the existential statements that 
led to the disco\ery of electricity * 

In cither case, the existential statement can be verified not only by one 
operation that can be reproduced but by many different kinds of 
operations that are all reproducible As more and more kinds of these 
different but confirmatory operations are discovered, there comes a 
^ ^^‘cniists tend to accept the former construction as an 
f^ct AnM t* ^ ^nce a hypothesis is now accepted as a 

covered r?h ‘he existence of this fact will L dis- 

at incxhausi hi Af ' operations may be treated 

fac tua" of c^ul tnultiple confirmation as 

bUKk for new theories and as a stepping stone to new discoveries ^ 
An existential statement can nl*. fi.» r,, r <Jo-coveries 

kc the form of an invention We can 


I5S9) rr (London Hulchinson, 

Kctlion) on Kjtl Popper i toraenSS, ® "'""'m sec Paul Bornays ' Rc- 

•a Jameill Conani e Cl’ 77-M 

Pro>.19Jn pp 32-37 262 ’63 Vale Unircrsily 

p-trenlej al conreienceorAmenaVA^a LLyrical Real.lyn • paper 

L W tValwh •OnTheoC;^T“,!^^7“rAr.,andSe,cnee, Bor, on, 1955. 
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say not only “there is" but also “there can be ’’ A proposition of the 
form ‘ There can be a machine that flies, and tint is heavier than air, 
and tliough it has not yet been bmll, it could be built" was one of the 
crucial intellectual steps which led to the inscnlion of the airplane 

Not only can existential statements be disconfirmed m principle, but 
they can be confirmed positively Sir Karl Popper has made a great deal 
of the argument that co/isinicfSy the more abstract and not immediately 
verifiable concepts of scientific theory, arc susceptible to disconfirma 
tion but not to confirmation Indeed he has denied that they can ever 
be confirmed ^ Existential statements, ontheoihcr band are susceptible 
to positive confirmation We can build (he airplane We can see whether 
the continent of America exists Wc can repeat these operations, and 
so can others 

Even more extended conceptual schemes can be turned intoexistcnlia! 
statements Wc may have many scattered geographical data and orga- 
nize them into a conceptual scheme of the fiat and bounded earth 
although at the time when it was constructed no individual had ever 
traversed this flat earth to Us supposed limns Another theory or con- 
ceptual scheme was then developed picturing the earth as round This 
was done at a time when no one had circumnavigated the world At 
that lime, the globe was only a conceptual scheme, and if Sir Karl 
Popper had been alive in the thirteenth century, he would have argued 
that It could only be disconfirmed. but not confirmed because it was 
a theory Howeier, in the fullness of lime, the earth was circumnavi- 
gated by Magellan in three years and more recently by satellites in 
eighty minutes What had been a conceptual scheme became an exis 
tential statement 

One of the most important processes of science is exactly, therefore, 
the turning of conceptual schemes mto existential statements The 
physicist P W Bridgman has spent some lime in explaining how the 
conceptofthcelectronmthel890swasa construct" and presumably 
in Popper s sense, only susceptible to radical disconfirmation However, 
in time, as cumulative evidence built up. physicists came to the con- 
clusion that ‘ there was something there “They now treated the electron 
as a concept that bad proved fruitful and for which there was some 
direct evidence, and eventually on the strength of still more evidence, 
they attributed reality to it This was exactly the difference between 
the ether concept, which could never by turned into an existential 
statement, and the electron concept, which could be transformed m 


* Cf Popper op cil passim 
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that sense ‘ Poppefs vew - that there ts nothing m saence natural or 
social but theoretical constructs susceptible to nothing but discon 
firmation — Ignores this crucial contribution of existential statements 
and discoveries It is the philosophy of a philosopher who has never 
made an empincal discovery m his life and who. perhaps would not 
even like one if he saw it It is in this sense, a serious obstacle to the 
growth of scientific knowledge m the social sciences 

Knowledge, then, is first of all a collection of existential statements, 
of statements capable of positive confirmation But it also consists of 
theories We may define theory as an information code for the storage, 
retrieval, and processing of new items of information, and for the search 
for new items of information * 

Theories can be disconfirmcd in their predictive dimension, and this 
IS the element of truth m Poppers arguments But theories can also, 
m part, be confirmed positively as to their relative truth content in 
terms of the collection of verified existential statements that they encode 
These existential statements then, would also have to be included (at 
least most of them) in their successor theories 
In this sense, the succession of theories is not arbitrary or accidental 
We can predict of any theory that, when and if it is overthrown, all or 
most of the existential statements that it encoded will have to be included 
in Its successor theories if these successor theories are not to be con- 


sidered a backward step in knowledge. 

The Impact of Quan/ifafne Dafa Quantilalivc data and methods m 
any field of science add a great deal to the specificity and richness of 
the existential statements m our knowledge Therefore, they increase 
the demands and the tests of verification confronting our theories In 
this sense, the introduction, on a very broad front, of quantitative 
methods and of quantitative data is of decisive importance in making 
our social science theories more scienufy; 

Let me give a few examples of the specific use of such existential 
statements in the testing and development of theories One is the 
existential statement that in West German opinion polls for more than 
a decade, national issues as a primary topic of saliency or attention 
have outweighed issues of European integration, usually by a ratio of 
ten to one or fifteen to one That is, typically, 51 per cent of voters 
consider national issues as dc^rvmg primary attention, while only 3 


'’'■Idgman The Logic of Modem Phyjics (New York Macmillan. 1946) 
* Sec references in footnote 2. 
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per cent or 5 per cent put European integration first This is an existen- 
tial fact, and it tells us somelbmg about politics at a certain time and 
place Or here is an existential statement for a slightly longer period 
It IS possible to show, on the one hand, increases in certain quantifatne 
indicators of European mtegration all the way from 1913 to the early 
I950’s or to the mid 1950 s, as well as the rise and growth of a wide 
variety of European political institutions, but these indicators fail to 
show any further structural integration of Western Europe after 1939 
That IS, although we have a fairly stable increase from 19J3 to 1935- 
1959, we find that after 1959 Western European structural integration 
has halted * Another existential finding shows that French mass opinion 
began to diverge radically from the Western European consensus on 
such important issues as attitudes toward the Soviet Union as early as 
1955, that IS, preceding the accession to power of General de Gaulle 
It was not President dc Gaulle who produced the French attitudes, 
the change m French attitudes preceded General de Gaulle Similarly, 
there is substantial evidence of a relative withdrawal of the French 
economy from international integration The economies of Germany, 
Italy, and Britain today have roughly similar ratios of foreign trade to 
gross national product as they had in 1928 The French economy has a 
substantially smaller ratio — about a third less of it ’ France has be- 
come, m many ways, more self-sufficient m the long run This fact may 
be changing in the late I9d0s. and we can interpret this in different 
ways, but there is no denying the fact itself, once it has been discovered 
and put into significant context 

I have given a number of examples from some recent research ne 
have done on European integration There is the gap, let us say, m the 
mass opinion both in Germany and m France, between expression 
of mutual sympathy and indications of trust Usually we find about 
three Frenchmen expressing empathy or “good feelings" about Ger- 
many for one Frenchman willing lo express trust The same seems true 
of German mass opinion * Other facts beanng on the probability of 
lasting poliijcal integration among nations are the ten to-one pre- 
ponderance of domestic mail over foreign in most advanced countries, 
including small countries with reputations for internationalism, such 


‘ The indicator figures are based on reproducible facts The inferences of iheir 
siEnificance arc. of course, hypotheses based on our conceptual scheme of mt^rairon 
Cf K W Dcutsch L J Edinger R C Macndis and R L. Memtf flv/rce Cermany 
andihe H’esiern Alhancet.'tie'MYoik Sentmer 1967} 

■'fWd.pp 230-255 
• Ibid , passim 
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as Switzerland the Netherlands Britain or Trance m contrast with 
colonial countries where foreign mail predominates over domestic 
mail ® Other such facts are the declining rate of foreign trade both with 
time and with the size of the country (on which we have a great quantity 
of data) and the near symmetry of the indices of relative acceptance in 
transaction flow and their correlation with empire or past memories of 
empire 

Another existential statement of theoretical relevance is the large 
variability of indices of income inequality Some of the work of Hay- 
ward Alker and Bruce Russelt produced data which seem incompatible 
with the theories of Vilfrcdo Pareto namely, the view that income 
inequality was supposed to be independent of social change or political 
revolution Paretos view turned out to be false There is now a 
possibility of getting quantitative indicators of social mobility in dif 
ferent countries thanks to the work on matrix analysis by Harrison 
White, I Richard Savage and Leo Goodman “ Or it turns out that 
in a comparison among 53 countries in the 1950 s the frequency of 
political violence changes with the level of income political violence 
increases with income over low income ranges but stays roughly un- 
changed over an intermediate range and it decreases with income over 
high income ranges 

All these are examples of existential statements which ought to be 
accommodated, which put demands and pressures on the theoretical 
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structures that we hate bcrorc us Many other facts arc pressing on our 
theories The expansion ormiddle-classjobs Ihechangesjnthestructure 
between skilled and unskilled labor and between white collar and blue 
collar jobs — all these facts have been well known Ncverihcless their 
cumuhtnc pressure on social science theories has become great and 
our social science theories ha\e not been accommodating them 

Heurisuc Sut^^esuons for Wch Inquiries 
The specification of our theories leads almost immediately to heuristic 
suggestions Merc I would like to say something about heuristic sugges 
lions in regard to the search for the turning points and the limits of 
processes and relationships Classic Aristotelian thinking seeks the 
of rcgulanlies in social behavior and fends to point out the limits 
m order to urge the social scientist or philosopher to keep away from 
them because all excess is to be avoided Pragmatic thinking »ery 
Widespread in social science in the United States happily ignores such 
limits it works wtili the here and now and it does not often ruse the 
question Mow far can this thing go and at what point will it break 
down’’ Marxist thinking selects certain types of limits and then 
emphastres w here some dialectic change ought to be expected Usually, 
however Marxist thinking only points out that some particular change 
xsiHIiappen hut « docs not invcsiigaiem detail just how it works and 
other limits which arc not highlighted in classic Marxist theory are 
very often not studied or their study is delayed (It may be that we in 
the West arc not informed enough about the development of Marxist 
social science but my impression is tin! while most of the limits 
pointed out fifty years ago in the Marxist classics may have been 
studied only very few new limits or new qualilativc changes have 
attracted the attention of our colleagues in the countries that we some- 
times inaccurately call the Enst ) The need in all these matters is for 
more scientific thinking for more attention to the limits of relationships 
nnd to the points where things break down In (hat sense the work of 
the late V O Key Duncan MacRae Philip Converse and others on 
critical elections and basic realignments is of real interest 

In the future before us we may have a period of far more careful 
studies of just where the regularities that «e observe change and just 


*VOKeyJr A Theory of Cr I cal Ekci ons. Jour al o/ Pol i cs 17 (\955) 
PP IIS {’hip Convene Sue ey Research and Ibc Decod ns or Patterns m 
EcoIoB cal Data Chapter 17 in ih* volume Duncan MacRae Jr andJamesA 
Mcldrum Factor Ana/ys s of Aggregate Voi ttg Slat sites Chapter 18 in in s 
volume 
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vihere the thresholds come We may need, in this respect, a better theory 
of national and supranational integration, the first five existential state- 
ments of which I gave examples were all taken from this area In 
another area, we may need a better theory of the limits of privileges — 
social, political, and economic — and the limits of equalization We 
have onl> a very rough knowledge that there is some extent of income 
equality that seems essential for maintaining matenal incentives for 
efforts, and we know rather vaguely that there are, on the other hand, 
some limits of inequality be>ond which “excessive” inequality tends to 
destroy social cohesion and the capacity of co operation in a society 
But we do not have a very good theory for this We have bits and pieces, 
and we have arguments about them, but there has not been any coherent 
theory of the balance betvieen the incentive function of income equaliza- 
tion, whether in collectivislic economics or in so-called private enter 
prise economies 

We also lack a theory of the future of violence in relation to rising 
income levels All past history deals with situations where most incomes 
were so low that the marginal utility of an increment in income was 
high enough to offer remrorcement for the learning of violent habits, 
or of violent action for or against change We do not know m practice, 
but wc must assume, in theory, that there could be income levels where 
mmhi cdditional increment in economic income 

nm or violence We do 

some ihcom “Peace level’ of income is, but wc have 
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Uttov Tr cMnoT,” 7 ‘’f ■Iccl.n.ng marginal 

r 7L ihat n-7 r 1 """ 'ho PO ®oe level of 

nufw?sh7 fP' ’'">'0^ "osht still 

shall therefore reach a s'lluaton dlSernm f'’' 

known “ O'fierent from any other we have ever 

to the 1930 s. wta ’'wran'Mr'rar m"* ^ 

S700 to 51.500 in modtm income developed from about 

wcreaeeomZ.edb7l74T„“r,r 

and from the 1950 s on uh Wc know that nowadays, 

52,000 to 53,000 per caoitLArr incomes ranged between 

poo capita. American labor disputes were being carried 

“Nation and World" 

McOrav. n,il, i<Xi7) 3^227 Political Philosophy (New York 



METHODOLOGICAL PROBLEMS 


27 


on without loss of life, even in the New York newspaper industry 
There may be very prolonged strikes but there is no longer loss of life 
Wage increases are no longer a killing matter The only economically 
tinged disputes m which lives are lost in the I960 s occur m the slums 
of our Negro minority, we know that the Negro per capita income m 
the mid 1960’s a\eraged 60 per cent of the white income, that is, m 
the Si, 500 to 51,800 range, which seems ivell below the peace level 
range 

If ue add a substantnl safety factor we could therefore predict that 
quite conceivably the economic reinforcement of violence might disap- 
pear in the world and in international relations if we could increase 
world income to 54,800 per capita which is eight times the present 
5600 average world income per capita This would require an eightfold 
increase of the per capita income of the world and can be attained 
within about seventy years by means of a 3 per cent growth rate of per 
capita income, which was the average income growth rate per capita 
for 68 countries in the 1950 s, and which is similar to the past per 
capita growth rale of between 2 and 3 per cent which wc hnse known 
in recent decades That is we could reach it m just about the time 
that the world population growth would increase fourfold, which is 
just about the limit we can handle on this planet more or less conve- 
niently 

To be sure, world peace probably will also require a substantial 
reduction of the present extreme degree of inequality of incomes among 
countries as well as within many of (hem Such income inequaUties 
will be less strongly defended, however, as average incomes rise, thus 
resistance in the United States and m many other advanced countries 
declined as average incomes rose 

In other words, we find (hat we get projections that though they 
seem fantastic from one point of view, are very much m line with what 
we have found to be manageable based on the experience of modem 
technologies in the last fifteen or twenty years Such considerations also 
can give us a new approach to the old question of the persistence and 
the limits of class politics in highly developed countries Additional 
relevant data and projections on these problems could be obtained on 
the inequality issue and on the job structure issue Thus we can use our 
quantitatne data to increase our ability to recognize crucial variables 


’’The average per capita growth rale for 68 countries in 1950-1960 was 2 99 ivr 
cent ihe median rate was 2 55 Russettera/. HW</ 

earlier longJcrm growih rates, sec Simon Kumets Modern T 

/Jure SrrMiuK and Spread (New Haven Yale University Press. 1966). pp 64-71 
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and strategic points that may lead us into \ery large theoretical issues 
We need perhaps e\en more badly todeselop askill as social scientists 
that wiW giNC us greater powers of recognition in regard to numbers 
An> mother of a familj in an advanced Fahrenheit-oriented country 


knows the difference between 99 and 101 degrees in her child’s tem- 
perature (or between 37 and 41 degrees ma centigrade oriented country) 
Every long surviving automobile driver knows the difference between 
driving 40 miles per hour and 70 miles per hour But very few social 
saentisls know the meaning of the difference between an assimilation 
rate of one tenth of one per cent a year and one half of one per cent a 
year, or between the social mobilization rate of one tenth of one percent 
and one per cent a year We need a generation of social scientists to 
whom percentage ratios of social change, or the difference between the 


upper middle class of 21 per cent of the work force and of 5 per cent of 
the work force, is as intuitively obvious and familiar as miles per hour 
and acceleration rates are to drivers and temperature levels arc to house- 
wives In other words, we need an intuilive familiarity with figures that 
will make them as familiar to the social scientists as sand is to the Arab 
and snow to the Eskimo Such familiarity will lead to a better under- 
standing of subtle changes, and to a greater awareness of where the 
changes and choice points come 

reasons the kind of work done by such social saentists as 
'“<is “5 from the tact.es of better 
toer "'"‘'"a “t'ltnce We find that 
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preshaped sequence of logical aUernatues among divercenl ami 
mutually contradictory hypotheses leading from choice point to choice 
point and eliminating at each choice point hrge classes of alternatives 
so as to narrow down rapidly ihe search for a solution This procedure 
requires that we can devise or discover for each choice point a crucial 
experiment or crucial set of data or observaiions that wjJJ permt us 
at this choice point to makea decision with a high degree of reliability 
To make questions decidable thus requires finding techniques or cm 
pineal evidence that will help us to decide them 
It might also require and this 1 would like to add to John Platt s 
paradigm thataftereverychoicepoint oraftereveryfew choicepotnts 
we get some confirmatory feedback in terms of new existential state- 
ments new observations and discoveries Ou»c possibly vie might also 
require from time to time some indication whether the particular strategy 
or sequence of choice points which we have been following also shows 
some heuristic effectiveness 

The questions 1 would like to pul before us are Which questions 
are decidable^ Why is it that certain questions are decidable and certain 
sciences have been successful in this and others ire noi'> Which prob- 
lems can be encoded in such a manner as to permit their treatment by 
methods of strong inference^ 

So far as I can see the applicability of strong inference methods 
requires at least four necessary conditions 

] AW vinables relevant to the theorv to be tested must be not only 
clearly recognized but del mited with h gh reliability so that we can gel 
statistical universes with sharply bounded limits 

2 Each variable must be relatively uniform so that its distribution 
over the universe concerned is likely to be well represented by samples 
of limited size 

3 Thcrelevantcorrelationsmustbcrclativelycxhaustive that is they 
must account for a large part of the variance to be explained 

4 The relevant correlations must be relatively stable over time so as 
to permit good prediction across all relevant penods These points must 
be elaborated 

With regard to the first condition a question becomes decidable in 
the sense of strong inference if all or most of the members of ihcrclcv'ant 
class of phenomena or vanibles can be clearly recognized and delimited 
by standardized operations and if Ihe presence or absence of an Hem 
or any relevant state of a varwblc can be clearly recognized This 
criterion excludes therefore the r/e/irffftcdtmension of social science — 
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what Gerald Holton has called “perennjal themes’’’^ or others have 
called the “haunted house” doctrines,** in which at least one variable, 
for instance, “economic cause,” has no clearly defined limit If 
we say that every political event has an economic cause, it is very 
hard to find out what would not be an economic cause The universe, 
the statistical ensemble, of socioeconomic causes is so loosely defined 
that we cannot find any reliable frequency distributions Therefore, 
sse cannot make here any statistical statements, and we cannot 
draw strong inferences We need narrower definitions If the variable 
IS to be used m formulating decidable questions, then the presence 
or absence of any crucial item must be clearly established That was 
the first demand 


The second requirement is that of relative uniformity This condition 
requires that the variance of the class of phenomena we arc using must 
be relatively limited, so that a tolerably small sample, a sample small 
enough that we can alTord it with the limited research budget available 
to us, IS truly representative Remember that if the class has very high 
variance, then even a sample with a large number of items in it will not 
be highly representative of the class as a whole Therefore, the more 
uni orm t e c ass, the smaller will be the size of the smallest acceptable 
sample, and the size of samples we can work with is limited by our time 
samples we can afford will necessarily 
Wh and (he ranges of the.r states, about 

"al.d and not tr.v.al We 
hTomoeeln f Phaaantena that are sufficently 
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accessible to them There is, thcrerore, a historic dimension m the 
statistics that we can deal with at any particular time 
The third point is the degree of exhaustiveness of a correlation If 
we want to make a problem decidable by methods of strong inference, 
we shall need variables that will account for such a large part of the 
total variance (hat we can account for most of the variance of the phe- 
nomena relevant to the theory concerned If we find that we have even 
quite good and statistically acceptable data, concerningsomeconditions 
or background factors or determinants which account for 10 per cent 
of the variance but leave 90 per cent m the realm of ignorance or tn the 
realm of pure chance, we cannot pretend to have decided a scientific 
question by methods of strong inference If. on the other hand, we can 
account for two thirds, three fourths, or nine tenths of the variance, 
the claim of strong inference will be considerably more plausible, and 
it will be much better founded This is what I meant by the relative 
exhaustiveness of the relevant data and their correlations We can be 
sure we can never completely exhaust the variance observed tn many 
phenomena, but we should like to be able to account for at least more 
than one half of the variance Once we have reached that level, our 
inferences can begin to become strong 
Fourth, and finally, (he correlations, the probabilities, and the dislri- 
buttons we are dealing with must all not change too erratically during 
the time of observations or during (he time relevant for our inferences 
to be drawn from them The statistics of the behavior of caterpillars 
permit few if any strong inferences about the behavior of builernies 
This may exclude certain phenomena of evolution, rapid change, or 
nonconverging feedback, where the occurrence of certain phenomena 
will increase or decrease (he probability of their own recurrence If 
these changes occur at time scales which are not critical for the observa- 
tions on which the theory is to be based, strong inference should be 
possible, but if they occur at faster rales, it is not 
Wilhm the limits of these four conditions, however, quantitative data 
can be used in aiding strong inference methods to turn philosophical 
problems of men’s social behavior into decidable questions of social 
sciences There arc many examples of this possibility Karl Marx and 
John Stuart Mill assumed that the world was gcUing more international, 
that the world economy and world culture were growing faster than 
national economics and national cultures Werner Sombart assumed the 
opposite, and Sir Norman Angell tn 1940 assumed that the growth was 
at identical rates This question is deadable for particular times and 
places by means of quantitative ewdenoe Jl happens that in the case of 
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Britain which may be typical for some other countries. Marx and Mill 
were right for the middle decades of the nineteenth century, from about 
1830 to 1870 It turns out, however, that for Britain m the period 1871 
to 1914, Sir Norman Angcll’s view was most realistic, and that during 
the period 1914 to the 1960 s, the Sombart theory of the faster growth 
of national economics and of national communities seems to be true for 
Britain, as well as for many other countries (What this tells us for the 
future is, of course, another question ) 

There arc many problems in social science for which we can draw 
strong inferences, at least for periods of half centuries In such cases, 
we can use strong-inference procedures for deciding among alternative 
lines of theory and research at each further step, and we need, therefore, 
relevant empirical data scries over time and across nations 
We also need to a much greater extent a study of existing theories 
We have neglpcted to study the stock of classic major social science 
theories so as to try to recode them into propositions suitable for study 
by strong-inference methods We need a reformulation, then, of these 
existing theories, and of any new or emerging theories, so as to recast 
them into terms testable by quantitative evidence, and we need to lest 
them for their ripeness for treatment by slrong-infcrcncc methods As 
the era of quantitative research and of mathematical analysis is getting 
under way, the era of serious and strong theory construction will be 
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trade as well as for social mobiltt) inside pirticulir countries * 

In this simple scheme, llic rows v-ill show the range of applications 
and hence the gencralilj of each t>pc of model On the other hand, 
the columns will show that a particular substantive problem can be, or 
has been handled by two or several radictllj different mathematical 
models It will thus show the multiplicity of clTLCtivc approaches to 
each problem It would be convenient to put the most frequently used 
mathematical models into our top rows and to list the least frequently 
used models toward the bottom Similarly, in our columns, the prob- 
lems that are most often quantitatively treated these days should be 
put into the left hand columns, and the problems which arc least often 
so treated should go into the right hand columns In this arrangement 
the density, so to speak, or the current salicncy of mathcmalica! ap- 
proaches to social science problems would fall off along the mam 
diagonal In lime, of course, the matrix would become larger, but we 
could still explore it quickly because the most important and relevant 
areas would always be found near the top left-hand corner Wc could 
then work out a reasonably serious survey of all the different models, 
or of most of thern, that arc now being used and of all or most of the 
h 'f'”* “1'° h'lp “S 10 find out. 
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a function of the facts of life, wc could try to make our computing 
programs and our centers bilingual B> degrees uc could tr> to put all 
computer routines into two formats One would be a private format 
for the men who invented it their private language, their cherished 
conventions just as they like it and feel most comfortable with it The 
second would require extra budgetary appropriations from the science 
foundations and from the computing centers for hiring translators and 
for putting into a public format any routine that has turned out to be 
reasonably important We might have general rules, supplemented by 
screening committees of experts, to decide which program routines arc 
worth this expenditure To some extent, of course, the University of 
Michigan is doing this with survey data when a deck is cleaned up and 
double punches and other abominations which make it hard to use the 
deck again are taken out But this should be done for all our computer 
routines which arc of any great interest, so that we can have public 
formats that can, ideally, be exchanged around the world 


The Growth of Secouctar} Data Budgciins of Ftmcis and Time 
We need, further, a series of decisions based on a belter awareness 
of the point which our data libraries will reach when they will come to 
contain more secondary data than primary data Most of our informa- 
contained, not in the raw data but m the 
(. t cm Ratios and rales of change are of as much, and 
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number of people holding a prejudice has been cut m It is my 

guess that, m modern society, it is about 25 years I am thinking, in the 
field of American politics, about the dwindling of the opposition to 
the St Lawrence Seaway and of the opposition to equal rights and 
integration for Negroes But this is, as you sec, a desperately rough 
guess Our survey research specialists could find much better figures on 
the period it takes to reduce a substantial prejudice and under what 
conditions it is reduced What, here, arc m each case the critical areas 
and proportions'’ And what are the relevant signal-to-noise ratios and 
their equivalents m social processes'’ 


Ctimula!i\e Effects of Visibility and Speed of Intellectual 
Communication 

Finally, as the last point, what can we do to make more of our data 
visually perceptible'’ This is not a matter of showmanship, nor is it a 
minor matter Hayward Alkcr succeeded, as many others have, m 
making visible certain relations in the voting of nations m the United 
Nations Assembly by means of factor and proximity analysis By show* 
ing clusters of issues or factors in two dimensional space, he makes it 
possible for people to see at once something of what goes on This is 
V* y lattei Dogan’s work on making changes in Communist regions 
Visible IS not trivial 
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Data gathering data processing data transmission — if mc improve 
their methods even to a small degree — can add new poasers to this 
collective thinking of mankind In the field of quantitative ccolog> 
that is m the study of the interplay of man and his environment in 
space and time and at ibe level of individuals groups nations and the 
international system we can apply these methods and search for new 
and more powerful ones In our age of unprecedented dancers these 
new powers are urgently needed 
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ecological and individual correlations clearly differ.^ Ecological 
correlations may tell something about territorial units, which can be 
used as contextual properties explaining the variations in the correlated 
variables Scheuch’s formulation, that one cannot argue in principle 
against the statement that individual and ecological correlations usually 
will not coincide but that one cannot rest with such an observation, 
appears as an authoritative assertion of the present-day position Not 
only are territorial units important in their own right but there are also 
methods for estimating the size of error when ecological data are used 
for inferences about individual units ^ 


DELIBERATE USES OF THE “ECOLOGICAL FALLACY” 

Considering the actual practices in social science, one is tempted to 
defend not only ecological correlations but also what could be labeled 
the “deliberate use of the ecological fallacy” as well Even if it is 
assumed that ecological correlations are seldom the same as their 
corresponding individual correlations, one may ask whether demon" 
stration of individual correlations is the prime objective in sociological 
and political studies Certainly not* The mam objective is causal ex- 
planations, not demonstrations of correlations 
Esen taking into consideration the serious and promising work that 
IS going on today m finding precise criteria and procedures for causal 
in ercnccs in nonexpcnmental research, one may safely say that most 
causa exp anations in today s social science rest heavily on some form 
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rather unreasonable to maintain that ecological research should be 
restricted to the demonstration of ecological correlations However, 
we may go further than that In a survey analysis, m which the analyst 
IS making statements about the behavior of individuals on the basis of 
variables denoting individual properties, the force and power of the 
analyst’s presentation will rest very much on his skill m making the 
causal interpretation, in other words, his description of the intervening 
mechanisms In survey analysis — as in other nonexperimental work — 
there will be not only an apparent but also a strategically important 
speculative or interpretive element Causal interpretations will always be 
hypothetical, and we may say that m making causal interpretations it 
is not a question of valid demonstration but rather a matter of present- 
ing fruitful hypotheses As an old rule says, the way of invention 
(finding hypotheses) is definitely something other than the way of 
demonstration In making inventions or finding hypotheses the re- 
searcher may use whatever he has in his power — of course, with some 
ethical limitations — in order to proceed Turning to “the deliberate 
use of the ecological fallacy,” one may contend that the analysis of 
ecological data and correlations may indeed for some researchers in 


some circumstances be a powerful tool in making statements about 
individual behavior It may be that the ecological data facilitate fruitful 
causal interpretations belter than the corresponding individual data, »r 
such are available Now u may be argued that in such a case ecological 
correlalions have actually been used as subsl, lutes for indisrdual cor- 
tn this fashion, but this hind of 
researrh Certainly introduces strong restrictions on actual 
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presenting or defending methodologically loose research pnctices*^ The 
present author s answer would be that it is often important to analyze 
liberal mindedly the prevailing research practices and to ask why they 
prevail The deliberate users of ecological correlations for making 
conclusions about individuals may have had some other scientific 
criteria in mind than did the di^ussants of (he quantitative ecological 
method The former would probably stress something that for the time 
being could be labeled fruitfulness white the latter focus onthcqucstion 
of how safe and probable is the evidence The contention here is that 
discussions on quantitative ecological analysis should not only deal with 
the problem of what inferences are allowed on the basis of aggregate 
data but should also be forcefully directed toward the question on the 
fruitfulness of ecological data 

This discussion Ins a rather old fashioned flavor The real issue in 
methodological discussions of today does not seem to be the ecological 
fallacy but rather the techniques used m analyzing data from many 
levels of social organization It has also been shown, for instance by 
Schcuch. that there are also individualistic fallacies that is, invalid 
inferences about territorial units on the basis of individual data,* 
and a systematic treatment of possible fallacies has been presented by 
Alker ’ Nevertheless the discussions in terms of fallacies has tended to 
divert the attention from asking questions about fruitfulness Of course, 
the question of evidence is central, but in the following section we shall 
discuss problems in terms of fruitfulness 


DIFFERENT FOCUSES FOR ANALYSES OF SOCIAL 
SiSTEMS 

In order to proceed we require a simple classification about types of 
analysis that differ according to the level of a social and political system 

they refer to Instead of dealing mamly with ecological data wc shall 
consider the more general case of grouped or aggregate data A simpli 
fled classification has been presented by Matilda White Riley, who 
distinguishes between four types of analysis * 

1 The first type is group analysis m which the data deal entirely with 
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properties of the groups and aggregates and the conclusions also focus 
on the group As an example Riley mentions Kingsley Davis and Hilda 
Golden’s result, according to which there is a negative correlation 
between agnculturahsm and urbanization in different countries of the 
world 


2 In structural analysis the data and conclusions also focus on the 
group, but individual variables are used either for specifying the result or 
for checking its validity Riley's example is taken from Fans and 
Dunham's famous study of the relation between mental illness and 
socialdisorganization In order to check that their main finding concern- 
ing the relationship between rates of menial illness and patterns of 
social disorganization in different zones of the city was valid. Fans and 
Dunham compared segments or groups of the several zones in which 
individuals were alike with respect to an individual characteristic 


3 Another type is contextual analysis, which differs from structural 
anal>sts m that the mam focus is on individuals, but individual relation- 
ships are checked or specified by accounting for differences between 
the gro^s to which the individuals belong An example is provided by 
James Davids’ examination of the relationship between the degree of 
activity and the tendency to drop out of the group Here the role 
NO umc or the proportion of group members who are active role 
players appears to be an important contextual characteristic 
in individual analysis, of which there is. of course, 

We have learned in voting studies that, for 
rosiiivel'v media and political activity tend to be 
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very unfair, but an inspection of the examples presented seems to 
support the contention that structural anaijsis and contextual analysis 
are more fruitful In the group analysis case of the negative correlation 
between agncuUurahsm and urbanizaUott one has at least a feeling that 
there is a tautological element in the finding This feeling is c\cn stronger 
in the examples selected from voting surveys To be exposed to mass 
media is already a symptom of political activity, and to be an early 
decider during an election campaign is, if anything a symptom of 
political interest At least the examples from both the group and the 
individual analysis appear as decidedly trivial compared to the examples 
from structural and contextual analysis 
Many researchers have felt strongly for some time that the most 
fertile uses of ecological or generally aggregate data are those m which it 
is possible to combine both aggregate and individual data ’ This belief 
has been based mostly on intuition or on the experience of simultane* 
ously working with method and theory In the follow mg section I shall 
attempt to show why the combination of aggregate and individual data 
IS particularly fruitful 

THE INFORMATiyE VALUE OF AGGREGATE DATA 
The concept of fruitfulness has hitherto not been specified, and here, 
too, It will be exchanged for another concept, namely that of informa- 
tive value Among contemporary sociologists few have given so much 
explicit attention to the question of informative value as has Hans 
Zetterberg, who also asserts that sociologists have given much less 
attention to the question of informative value than to the question of 
evidence ® The notion of informative value has been put forward 
particularly by Karl Popper, who also speaks of the empirical or 
informative content of scientific statements The empirical content 
of a statement increases with its degree of falsifiabiJity ’ the larger 
the number of ways in which a statement can be proved false the 
greater its empirical content (the higberits informative value) Formu- 
tsdftg it cfi 3 moac ft'sy, out could also say that the jrreater 

variety of events a statement accounts for the higher its informative 

value According to Popper, “science does not aim, primarily, at high 
probabilities It aims at high informative content, well backed by 
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experience But a h>pothcsts nn> be \cry probable because it tells 
us nothing or very little’ 

Statements with low informative value arc what we call trivial, and, 
with regard to the empirical content, the most trivial of all statements 
are, of course, the tautological ones Triviality is a matter of degree One 
sometimes has the feeling that many sociological and political science 
statements are close to tautologies without being so completely The 
earlier mentioned result from voting surveys, according to which those 
who arc more interested in politics make their decision to vole earlier 
than those who are less interested in politics, is an example of such a 
statement It is close to being a tautology because early decisions may on 
the theoretical level very well be seen as a symptom of political interest, 
and in many studies one may actually use the time of the voting decision 
as an indicator or an operational definition of political aclivily. Never- 
statements are often presented as empirical findings, and 
the analyst would, if he is questioned, usually reply that the time of the 
vo ing ecision and political interest are operationally defined in- 
epen enl y of each other This is a standard reply, for example, when 
arc questioned about the triviality of their results 
of mformativc content cannot be discussed 
thorouoH of operational definitions Any kind of 
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gencralfzation can be falsified is indeed small, or. as also ctn be sard the 
number of polenljal falsifiers is very smiK 
It IS imporianl to note that the narrow scope of such dispositional 
generalization leads also to a sort of arbitrariness or instability m the 
use of operational definitions If one operational definition of the 
generalizing term such as the attendance at a political meeting, fails or 
does not confirm the gcncraJizaiion. it is easy to switch to another 
operational definition, such as the rending of political mformation, and 
hope for better luck When a generalization is scry narrow in scope and 
accordingly not too tied to other and difTerenf phenomena, there are 
actually no guides and fvw restrictions on how operational definitions 
should be chosen We may conclude that if in a generalization the 
generalizing term and the predicate concern very similar phenomena, 
there IS both a great likelihood for arbitrary adjustments of the opera- 
tional definittons and very few potential falsifiers Already this con- 
tention speaks for the value of generalizations containing variables 
from different levels, for instance, variables from both ihe group and 
the individual level The use of varnbles from different aggregate 
levels uill decrease (he likelihood that (hey concern the same type 
of phenomena and will increase (he likelihood of results of high 
informative value 

The ease can be put more strongly by considering the procedure of 
specification There is generally a feeling that iwo-variaWe relationships 
seldom are interesting in sociology and politics There are. of course, 
also good grounds for such a feeling If generalizations are to be 
theoretically relevant, they must be specified, that is their conditions of 
validity must be assessed In the elaboration of social science data 
specification usually means that (he generalizations are formulated in 
terms of at least three v.ariables In general, specification tends both to 
increase the number of potential falsifiers and also to eliminate arbi- 
trariness in the choice of operational definitions, since the specifiers 
indicate the potential connections of the operational definitions with 
other phenomena tf we could specify our generalization by saying 
that “the more people with a high education are interested in politics, 
the more they tend to delay (heir voting decision, whereas the more 
people with a low education arc interested in politics, the earlier they 
tend to make (heir voting decision ” vve have certainly addled to the 
empirical content and increased the number of potential falsifiers Here 
we may return to Riley's four types of analysis and say that already the 
fruitfulness of specification is one reason why structural or re aiional 
Mai, MS « often more .ateresting than simple groop or individaal 
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analysis Structural and contextual analyses involve in practice a 
systematic specification 

Structural and contextual analyses, however, also dilTer from group 
and individual analyses in that they involve variables from both group 
and individual levels The fruitfulness of specification rests, of course, 
on the nature of the specifiers used If the body of variables (theory) 
from which the specifier has been selected concerns the same type of 
phenomena as the generalization to be specified, there is a danger that 
the specification will not increase the degree of falsifiabihty There is 
likewise the risk that there will be much arbitrariness in selecting 


operational definitions If the body of variables from which the specifier 
has been selected deal with clearly different subject matter than the 
generalizations to be specified, the arbitrary adjustment of operational 
definitions becomes difficult Of course, for a specifier to be fruitful it 
also has to be related via theory or generalizations to other phenomena 


on Its own aggregate level In any case, if the specifier and the general- 
ization to be specified are from difierent aggregate levels, there is great 
likelihood that they will deal with clearly different subject matter and 
also great likelihood that the specified generalization will be of high 
informative value This is also why generalizations in psychology 
onen appear particularly inlcrcsting when they contain references to 
cither physiological or sociological theory One cannot, of course, rule 
ou the fact that specifications can be interesting even when the specifier 
and the generalization arc of the same aggregate level They can be so, 
provided that they really deal wuh sufficiently different subject matter 
crossing different aggregate levels either 
tool m or contextual specification is an extremely powerful 

tool m aiming at gcncral.zaiions wuh high informative content 


The mam conlcnlmns may be summed up a few short statements 
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or the ecological fallacy may be taken as an indiciljon of stnvjnss for 
results of high informative value through speculations and interprcta 
tions There exist deliberate and cold blooded uses of the ecological 
fallacy 

3 When theinfomjatjvevalueofastatcmenijsdefinedbytheniimber 

of uaj s it can be proved false it appears that generalizations containing 
variables dealing with entirely different subject matters or chosen from 
different thcorelical systems arc most likely to have high informative 
value Generalizations containing variables from different levels of a 
social system (different aggregate levels) are likely to have high in 
formative content and also likely to restrict arfailrarincss in the selection 
of operational definitions On the other hand generalizations con 
taming variables that all concern the same type of phenomena are 
usually trivial 

4 When generalizations arc specified the most fruitful specifications 
are those in which the specifying variable is chosen from a clearly 
different area from the generalization to be specified Again the sclcc 
tton of variables from different aggregate levels serves as a powerful 
safeguard against both triviality and arbitrariness with operational 
definitions 

5 For the reasons given earlier structural analysis (the generalization 
IS on the group level but the specifier on the individual level) and con 
textual analysis (the generalization is on the individual level but the 
specifier on the group level) will be extremely useful tools in arriving 
at generalizations with high informative content 

6 Of course it cannot be asvcried that all generalizations containing 
variables from the same aggregate level arc trivial Triviality however 
IS a matter of degree Structural and contextual generalizations tend as a 
rule to be less trivial than generalizations based either entirely on 
grouped or entirely on individual data 
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Individual and Structural Effects in 
Ecological Research 

Onc approach to the methodotogical problems of ecological research 
is to ask What kinds of inferences arc perm<5$ih\^ when usingaggregate 
data‘ or, more specifically, how can inferences about the behavior of 
individuals be made on the basis of such dafa?^ This paper also deals 
with these kinds of methodological problems The point of departure is 
not, however, the nature of data generally used in ecological research 
To solve problems associated with the use of ecological data it seems 
necessary to examine the sociological meaning of areal units and the 
kinds of efiecis their properties may have on the behavior of individuals 
Ecological analysis can be earned out fay faking the areal units as 
wholes in their own right ^ Jn most cases, however, the units are col- 
lectives, the properties of which are obtained by aggregating properties 
of smaller units (e g , of individuals), and if one wants to make causal 
interpretations of the observed relationships among variables, the 
multiple-level nature of the phenomena must be taken into account 


’ Sec Erwiti Sthcuch. Cross-Naiional Comparisons Using Aggregate Data 
Some Substantive and MetnodotoBical problems, R MernttandS Rokkan teds ). 
Ccimporm;? A'fl/fU/is (New Haven Vale University Press. 1965) pp 148-167 

* See, c E , Leo A Goodman, * Some ARcmaiivcs to Ecological Corrcfalion," 
/Imertc/irt Journal n/Socioloj!},64i\9S9),pp 610-625,0 D Duncan R P Cutzorl, 
and B Duncan, SraHsiirnl Cep^roph} (Glencoe Free Press 1961), pp 68-80, and 
Raymond Bouden, ‘•propricics rndtiMtucIlcs el propriilfs colicclives un problimc 
d’analyse dcologrque,” Feme fraaeaise de soeiohgie 4 (1963). pp. 275-299 

* Cr Paul F Lazarsfild and Herbert Menzef. ‘ Oo i?w Rclafwn befvteen lndi\ idual 
and Collective Properties, ‘ in A Etzioni (cd J, Complex Orgamsations (New York 
Holt, Rinehart, & Winston. 1961). p 425 
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The situation is then a special variant oT the type of sociological 
research in which there are units and variables from many levels of 
social organization Although much confusion over the conceptual 
definitions of the levels and their relationships still exists and few 
appropriate research methods have been worked out,* an attempt will 
be made here to examine ecolo^cal research as a type of multiple level 
research 

To make the matter concrete, let us cite an example m which we are 
studying the modernity of farming practices in a rural area The area 
can be divided into smaller areas, for instance, villages An average 
describing the modernity of all the farmers is calculated for every 
village These averages might be means of some kind of modernity 
scores of farmers or percentages showing the proportion of farmers who 
have adopted a given farming practice The question is What is the 
sociological meaning of the differences in modernity among villages, 
and what kinds of causal processes can give rise to the differences'^ 


SOMC MODELS EXPLAINING ECOLOGICAL VARIATION 

First, there is the possibility that the differences are random If the 
variance m individual modernity scores for the whole population and 
the sizes (number of farmers) of the areas are known, the expected 
variance of the averages of the areas, that is, the expected ecological 
variance, can be calculated, assuming the differences are random * The 
areal units used in ecological research arc usually so large that the 
averages of the areas would be practically equal Sociologically, that 
means t at no properties of the areas and no individual variables with 
nonrandom w^ogical variance would have an effect on the behavior 
^ ecological research, such a 

Uninteresting Such sociological variables whose 
howcier '’‘Plain'd by random factors are, 

expect to practice In psychological research one could 

ex^t to find such variables more often 

stati*ticarxen°xp”fn other significantly in a 

, c siaiplesl explanation would be that an individual- 

llixct tt V.oll'd’' IsS'p? M2 73?°"* “ dwroort (New York Horcourt, 

*1l-<fCK ho^kc^cr the '*■^5' 2'^'ys's of 

« analotmn lo “»puf,ou^ "onexislcncc of significant differences 

t»o tl m naic each other '"'“r'ey research ihc opposite effects of 
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Ic\cl variable x, which has ecological variance, has an effect on the 
dependent variable j Jr> is the modcrnjiy of farmers v could be, for 
example, education, if the inhabitants of the villages differ m average 
education In this case, y would also have nonrandom ecological 
varnnee because of the individual lc\cl relationship If no other 
ecologically relevant mdcpcndeni varjabJe has influence upon the 
slope of regression of the > averages on x av-erages is equal to the slope 
of regression of the respective individual variables In addition to the 
individual level slope calculated for the whole population, slopes can be 
calculated separately wiihm each area All these slopes should be at 
least approximately equal if the assumption of only one ecologically 
relevant effect is valid Tlic ecological correlation and the individual 
correlation tn the whole popuhtton or within areas arc not equal The 
size of the difference is dependent on the ratios of the ecological 
variances of the two vambles to the individual variances The correla 
(ion coclhcicnts within different areas are also unequal, if (he within area 
vanmees of r arc not equal ’ 

The case m which the ecological correlation is only a refiection of an 
individual level relationship has received special aiieniion because it is 
possible to approxfiTTaCc (he rrtdivtdual correlation by means of ecologr 
cal data using the equality of individual and ecological slopes or their 
analogies ^ If the ecological variation of > can be explained by the 
ecological variation of x, a new question may arise How then can the 
ecological variation of x be explained ’The explanation could be found 
in new individual variables that have ccologicil variance Howev'er, m 
some phase of the causal chain there apparently must be a global type 
variable characterizing the areas as wholes and explaining the differences 
in the compositions of the populations of the areas 

Jn most cases if is not realistic to assume that theecological variation 
of j can be explained by only one independent individual variable In 
practice usually there would be several ecologically relevant independent 
variables The situation is however, m principle the same as with one 
independent variable (he ecological relationships are only reflections 
of individual level causal processes From the point of view of the 


’These conclusions are based on HuberiM BWock Jr Cause/ M 

Nonexpenmenta! Research (Chapef II.H Univcrs.lyof Norih CaroJma Pms, I96J> 
pp 95-126 and Duncan efo/ ap at pp 6»-80 especially ihe equation on page 
66 = 6 +C’.j, (bt - b») v>hen « rbe mdmdual slope for Ihe who c 

poputalion‘°A„isfheavcrageiviibmflreas1c^ A, is the ecological slope and E ,a is 

ifTCCorrelaiionraiiof'nbeindcpendenivanaHexonitKarca 

• Sec Goodman op cU and Boudon nr 
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sociological meaning of ecological units, these kinds of processes are 
not very interesting In the explanation of the phenomenon under 
study no proper ecological effects have to be taken into account. 
Situations in which the properties of areas affect the behavior and 
properties of individuals are ecologically more interesting 
Contextual analysis has been the approach by which the effects of 
thepropertiesor groups of collectives on individuals have been studied 
In the methodological discussions especially a variant of contextual 
analysis appears to have been given considerable attention, namely the 
variant dealing with so-called structural effects’® or compositional 
effects ” The properties describing the collectives in this kind of 
analysis are analytical variables formed from the properties of members 
of the collectives The variables, m addition, usually also have purely 
individual-level effects on the dependent variable 
Davis, Spaeth, and Huson have made an attempt to classify different 
types of compositional effects, mainly according to their mathematical 
forms’* The first criterion is the linearity or nonlinearity of the 
compositional effect The second classification criterion is whether 
the compositional property of the collective has a direct effect on the 
individual-level dependent variable or on the relationship between 
the dependent and independent individual-level variables The direct- 
effect models of Davis, Spaeth, and Huson can be formalized by the 
following regression equation 


>n ^ o + bix, -j- bjXtf (1) 

Here tu IS the expected value of the dependent variable of the in- 
Mual I hvmg m the area; and having the value x„ of the independent 
n h '’/t the independent variable in the areay and 

n. bi, and 6. are constants ■> If,, ,s modernity of farming practices, the 


Quonerli^ii of Opinion’ Sludics,” Public Opuiian 

anal>-s.j as a partial fom of considers contextual 

analysis more general approach which he calls soaal system 

PP 178 - 193 ,' and'ArSld's’?*^“J*’’ Soaohgica! Eeuevt 25 (19(C) 

«rsus Individual Effects ~ G Bachman, ‘Structural 

"James A Davis. Joe 69 (1964). pp 585-595 

Che Effects ofGrouo^m^. "ATechniquefor Analy- 

PP 215-225 PComposinon. American Sociological EeMCH, 26 (1961). 

218-222. 
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indi\iduaWe\el Nanance of il m the whole population would be 
explained, according to the model, by the education of the individuals 
(bi =' 0) and. additionally, by the average cduaition in the areas 
(ht 0) If 6, and />, have the same sign, as seems natural, both 
indjwdual-level education and ecofogicaf.fevef education have an 
clTecJjntbcsamc direction If^iandA^havedifrercntsigns.thevanables 
on the two levels have opposite efTccls Davjs. Spaeth, and Huson'^ 
give an example of this seemingly improbable situation 
In addition to the complete models altcad> mentioned, the equation 
can describe three incomplete models corresponding to three types of 
cflccts dealt With by Davis Spacib and Huson ifA, = 0 and bt = 0, 
there IS no nonrandom cflect at all if/i, ®0\vhileZ>i /-O, there is only 
an mdividual-lcvel effect, and if 6, 0 and bj - 0. there is only a 

composiliona) effect In the last case, education would affect the 
behavior of individuals only through the average education of the areas 
As was the case with the reflections of individual level relationships, 
it IS probably unrealistic to assume that just one variable would have 
a Compositional efTcct on the behavior under study Therefore, several 
{ndependent variables must again be taken into account Some of them 
may have cITccts at both the ecological and the individual level, and 
some at only one or the other 

A modcf in which (he ecological variable, instead of having a direct 
effect on the dependent variable, has an effect on the relationship 
between the independent and the dependent variable can be represented 
formally as follows 

The individual-levef slopes of regression ofy on x are not equal in all 
areas, as was assunved in the preceding case The steepness of the slope 
IS now a linear function of the area averages of the independent variable 
If, for example, </ were negative, the education of a farmer would not 
have as strong an clTcct on his modernity m an area of high average 
education as m an area oflow aiera^educatfon The most usual case 
of this kind of interaction is probably '‘weak" interaction, where the 
sign of the slope remains the same m all areas ft is possible, however, 
that the slope is near to zero at cither end of the ecological variable or 
that It actually has dilTcrent signs within different areas 
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In the model represented by Equations 2, the variable afTecting the 
slope h, IS the ecological counterpart of the variable that explains } at 
the individual level Other ecological variables, too, can haveefiectson 
the slope In addition, the strengths of the cfTects of a number of in- 
dividual level variables may be functions of contextual variables 
It has been assumed, up to this point, that all effects are linear A 
large number of new models can be formed if nonlinear relationships 
arc allowed Davis, Spaeth, and Huson mention some examples, in 
which the effects of group level variables on individual-level variables 
are nonlinear, and it is easy to construct other types of nonlinear 
models 


In all the preceding cases the ecological variables have been measured 
analytically and thus have a counterpart at the individual level The 
situation does not change formally if the ecological variables arc global 
measures A global variable can also, for example, have an effect cither 
on the individual level variable or on the relationships between in- 
dividual-level variables In the discussion of structural or compositional 
effects, only analytical group level variables are usually considered 
This IS apparently due to the fact that only the existence of the same 
variables on both levels has posed special problems of analysis The 
distmction between analytical and global variables may be, however, 
useful for the interpretation of the results 
The effect have been introduced here by means of regression models 
‘"dividuals only formally, without discussing 
Hsc to the cffccts Knowlcdgc of these 
ecological research When the models are 
be verv a f’^f’hnear effects and interactions, it may, however, 

t Only the simplest models Will 

i>c examined next from this point of view 


SOCIOLOGICAL i^TERPRETATiONS OF STRUCTURAL 

effects 

namely = o ^ l*^^*'^**'^* structural effect mentioned before 
process that can eivVn«* 

"here the average education following in the area; 

norms are more niodc“ th " ^ 

the value s>siem and nnme ». of low average education, an( 

independent of their own the behavior of individual 

m .he majorura in areas where worl^er 

™y. a polmcal cLmal," favormg Icfust vol.ng ma; 



INDfVroUAL AND STRUCTURAL EFFECTS 


59 


exist Concepts like mine systems, norms, or climates describe areas 
as wholes and are "conceptually global • properties of areas ll 
IS not meaningful to speak of value systems or norms if they 
exist only internalized by mdisiduals In tim kind of explanation 
an analytical measure is thus used to measure a conceptually global 
property 

The existence of structural effect can be interpreted m another Wsiy, 
too Campbell and ^Mexander give an exTmple of it In studies of 
high-schoQ[ students, a structural effect has been found the larger the 
proportion of middle class students in a high school, the greater the 
likelihood that students of a given socioeconomic stratum have high 
educational aspirations This result can now be interpreted by means of 
differing value systems m the schools the average socioeconomic status 
of its students determines the value system of a school, and the students 
orient themselves toward the expectations constituted by the school- 
wide value sjstem Campbell and Alexander howeier, show that the 
structural effect can be explained without using variables tint charac- 
terize the whole collectivity They assume a two step causal process 
The first step is based on the social psychological theory presented, for 
example, by Festinger and Homans, according to which persons feeling 
attracljon to one another arc likely to become similar in their values 
and behavior As the second step of the explanation it is assumed that tn 
every school the friendship relations of the individuals are distributed 
randomly, without regard to socioeconomic status The fnerds of 
students m schools having high average socioeconomic status are then, 
on the average, from higher strata than the friends of students in other 
schools As friends are likely, because of the social psychological process 
mentioned before, to become similar to each other, it follows that the 
average socioeconomic status of the school has an effect independent of 
the properties of the individuals When in the empirical study by 
Campbell and Alexander the average socioeconomic status of (he two 
closest friends of high school students was controlled, the differences in 
the socioeconomic status of schools no longer explained ihe differences 
in educational aspirations of the students The status offnends was, cm 
the other hand, clearly related to the status of the school 
Tlie article by Campbell and Alexander has been quoted rather 


“ Tanwnbaum and Bachman op ett p 594 emphasize ihisdislmclion using the 
terms * siruciural concepl and slruciurat measure , j 

‘’Erncsl Q Campbell and C Norman Atewnder Structural EfTccts and 

Interpersonal Relauonships American Jounat of Sociology 77(1965) pp 284-389 

»• Ib/d, p 287 
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extensively because the “contagion” or learning cfTect*’ it deals with 
seems to be highl> relevant to research using areal units People living 
near each other have many contacts and mutual friendship relations 
Similarity of behavior and attitudes is to be expected as a result It can 
be assumed that some learning or contagion will take place without 
special attractions between people, if only because of daily interaction If 
the contagion process is usual, as seems likely, a structural eficcl 
caused by interaction and friendship relations will always be found 
when the ecological variance of a variable having an individual level 
effect is not small 

A consequence of the contagion model with several independent 
individual level variables can be stated as follows If there arc data on 
the composition of areas by individual level variables and on the 
relationships among the variables at the individual level, it is possible to 
calculate, assuming no structural effects, the expected ecological value 
of the dependent variable m each area The slope of regression 
of the observed ecological values on the expected values is then 
greater than one The observed values thus are greater than the 
expected ones in areas where the expected values arc above the average 
or all areas and lower in areas where the expected values are below 
the average 


possible to make the distinction between 
in nrJ-'t ^ ir **tc cfTects of interpersonal contacts, it is 
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■•pure” global efTects These lands orefrects are .ntercslmg and .nrport- 
ant, especially for practical applicalions of sociological knots ledge li^ke 
plannmg purposes If changes in the circumstances 
desired clobai properties oF the arcis atc usually changed first ne\ 
atotSne organizations local chapters of associations, schools 

-sr,::r'ir4-s== 

with It 
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most apparent when one uses units wilhm which the populations arc 
completely intermixed, that is when contacts and friendship relations 
arc random, at least in regard to the phenomena under study In the 
example given by Campbell and Alexander, the units arc schools, and 
there the assumption of random interaction probably is approximately 
valid In ecological research the interaction areas of dilTcrenl in- 
dividuals are not the same but overlapping One can imagine that 
in a rural area, where the population is concentrated in small 
villages isolated from one another, each village might be an area 
within which the majority of the interaction of the inhabitants 
would take place Usually, however, areal units arc considerably 
larger than the areas of interaction of individuals In addition, the 


contacts of individuals are not distributed randomly among the 
people living in the ecological unit, and the contacts arc not limited 
to those living in the same unit It is thus difficult to say how the 
manifestations of the contagion cITecl vary when different kinds of 
areal units are used The cfTccl is probably rather clear when most of 
the interaction of the individuals takes place within ihcir own areas 
and the areas arc small 


When dealing with global effects, the areal units that are most 
rcevant are those to which the global phenomena in question apply 
e e ccts of properties of communal administration on political 
are studied, there arc no alternatives m choosing the analytical 
can of the “best” or “natural ’ unit of an ecological study 

Clohal n ^ examining what types of units have relevant 

?hanl,neleTeT» "’“V be global properties at more 


nonsocioirp^raM^fr^'^^^' ”iadc more complex by the 
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Studied This imv «« . among sociological variables are 

factors can gne rise^*^ ^ fallacious, because nonsociological 
variables ° spurious correlations between sociological 
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STRUCTURAL EFFECTS WITH INDIVIDUAL AND 
AGOREGATE DATA 

In conlexlual analysis, as a rule, dala from both (he inilividu.il and 
the group le\el can be used (n ecological research, on the contrary, the 
most usual type of data are aggregate data — a fact which changes the 
basic situation Someanalytical problems will be examined here, first in 
studies with data from both levels and then in studies \Mth only aegre- 
gate data 

To isolate structural eflecis, Blau used a technique** that can be 
presented by the following example To find out if education has, 
besides an individual cfTect, a structural effect on farmers* modernity, 
the areas studied are divided into two subgroups those of high average 
education and those of low average education Within these subgroups 
the individuals in their turn arc divided into two groups according to 
their education If individuals with low education are more modern in 
areas of high average education than similar individuals in areas of low 
average education and if the same thing holds true for individuals with 
high education, a kind of structural effect has been found, according to 
Blau The technique presented by Davis, Spaeth, and Huson is based on 
the same pnnciplc, but the areas are divided into more than two sub- 
groups 

Tannenbaum and Bachman show, however, that both Blau’s tech- 
nique and the technique of Davis. Spaeth, and Huson are incorrect, and 
at feast some of the structural effects observed are probably not genuine 
The error is caused by the fact that if two classes are used when dividing 
individuals info subgroups, although the independent variable is not 
really dichotomous (like sex) bat quantitative (like education), the 
properties of individuals are not controlled strictly enough ** 

Several partial solutions to this problem have been proposed by 
Tannenbaum and Bachman Some are based on the use of more exact 
classifications and some on correlational methods One of the corre- 
lational methods IS multiple-regression analysis.** which in this paper 
has been used to formalize the models In it, the variance of the mdi- 
vidual-level dependent variable is explained by independent variables, 
some of which are properties of individuals and others analytical or 
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global contextual sanablcs measuring properties of tbc areas As the 
slopes m multiple-regression 4inal>sis arc in a v-ay analogous to partial 
correlations, the existence of structural cITccls is slio\sn if the slopes 
of contextual variables differ from zero When using this technique 
or others based on correlations, one must make sure that all the 
effects arc approximately additisc and linear. If they arc not, more 
complex models can be applied 

Compared to cross tabulation, correlational methods base several 
advantages data can be used more effectively so that more independent 
variables can be taken into account at the same lime, the fallacy shown 


by Tannenbaum and Bachman is avoided, and, relevant to the topic 
of this paper, the models and results will be more easily comparable to 
those of ecological analysis using aggregate data Tlic dichotomous or 
qualitative nature of some of the variables is not an insurmountable 
obstacle to the use of correlational methods, as such variables can be 
taken into the analysis as “dummy*’ variables ’’ If one is content to 
rnake inferences without the aid of exact statistical tests, assumptions 
about the distributions of variables arc not necessary 
When isolating structural effects there is a difficulty analogous to the 
one shown by Tannenbaum and Bachman which cannot, however, be 
0 VC y merely adopting a correlational technique If one would 
1 °^ that the larger the proportion of workers m an 

votincr P^^”"^2gcof both workers and the other inhabitants 

is as liVciu ^'5hl have found a true structural effect U 
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phenomenon ma> be generjJ, cspccMlIy m arcjy where migration is 
considcMblc Tlicsc eases arc interesting in ibur own nghi, but thev arc 
not similar to the structural effects proper 
ScKin and ilogitrom try ttr sohe .mother problem of contextual 
analysis Tins problem is earned b> the confounding of the efTccts of 
sariabfcs correlated with each other (n noncxpenmcntal research the 
collectives cannot be manipulateii in a way that allows the study of the 
cITccts of one collccine proper!) tontrolhng for the others As the 
number of lollcitncs is often small it is dilhcult to carry out the 
cross tabulations needed to sep irate tlic eflctis The solution proposed 
by Selv m and I lagst rom t\ the use of f.Ktor anal)sis Using n and taking 
colleciiv e$ as obsers ations they reduce the numlier of variables to a few 
(m the example, to fisc) Schm .ind Ihigsirom lake into factor analysis 
only analytical or, in their lirminology, aggregative measures Globil 
measures could probablj also be mtluded Sclvm and flagstrom make 
use of the results of factor .mily sit toclissify collectives by means of 
factor scores into four subgroups, which arc tlicn used in conlexlua) 
analysis A more effective way seems to be to use the factor scores of 
collectives as eontexiuil properties of individuals and then to use 
correlational techniques If .in orthogoml rotation is applied, ihef.iclor 
scores wilt be relatively untorrclaicd with each other Tins imkes it 
easier to separate the effects of the factors Instead orfactorscores.asct 
of the original variables could be used to characleriic the collectives 
A variable that Ins a high loading on the factor .md is meaningful m 
other ways is chosen to represent each factor This makes the analysis 
less cumbersome and less abstract 

Like most other sociological research, multiplcdcvcl analysis is 
usually based on noncxpenmcntal research design It is therefore 
exposed to all the usual sources of errors doc to difficulties m deter- 
mining the direction of causal relationships and in controlling for all 
relevant factors The multiple-level approach, however, makes it easier 
to .ivoid two kinds of fallacies of a more theoretical nature, which 
Rdey calN •'psychologistic fallacies’* and “socjologisiic fallacies ’ A 
psjchologistic fallacy is committed if the researcher, when explaining 
individual behavior, looks only at the facts about each separate in- 
dividual, disregarding factors such as the character of the community 
context The danger of a sociologistic fallacy is present ifthe individual- 
level processes are disregarded when one is interpreting relationships 


liinan C Scluii and Warren O Hagvirom "The Empinwl Cla«ificai.on of 
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between variables describing collectives These fallacies arc not 
necessarily due to the use of a certain type of data but are rather 
associated with the researcher’s theoretical frame of reference Although 
working with aggregate data, one can at least try to have regard for 
individual- and collcctne-lcvel relationships, as well as for the relation 
ships between collective and individual variables 


PROBLEMS OF ANALYSIS USING ONLY AGGREGATE DATA 

All the technical pitfalls dealt with previously are, of course, still 
present when one has to be content with aggregate data, as is usually the 
case in ecological research In addition to them, there arc then the 
special difhcultics caused by a lack of information about individual* 
level relationships The best-known fallacy associated with the use of 
aggregate data is the so called ecological or aggregative fallacy 
Since the article by Robinson and the discussion following it, this natvc 
^sumption, according to which the ecological correlation as such could 
^ used as an approximation of the individual correlation, is no longer 
rnmi » 1 . ^ sophisticatcd version of this fallacy is 

theif '!n *1 ’ ecological slope and the individual slope or 

their ^Ios,« are .ncorrectly supposed to be equal 

Of il,f have been defined by Alkcr by means 

Within * ^orem dividing the total individual covariance into 

rallac,esarfro'’e/lf covariances “ Some of the 

of course iheusii'-il ^ using aggregate data One is, 

one IS oertnns fallacy, of the others the most important 

the Onl) possible om- r' mentioned by Alker arc not 

> possible ones, if i, ,aU„ ,n,o consideration how insulTicient 

■ilKtsIlS; t> 158, uses Ihclcrm'inJoiilual 

«lrei,.,iy a, coltais.iy- S' |"“''^'“wf“Inessorancxnlanalionlhallre3lslh' 
'Ky 111 -iroup faiij’ to ihc idea of ps>cho1ogistiC 

^ »J«li1.keihc vxnologtsiic fallacy, although he put* 

fallacy "^‘**‘* "“*^'"8 the group fallacy a generalization of ihc 

Correlations and ihc Behavior of 

‘•T*; PP 351-357. and R.lcy. 

PP 10"* IfV ta 

of stones muSd S' ” ** aPn™P"’" 

' th give m of covariancCT av it n the rcgrcvvion 

*c«ncc (Blalocic op elt p 51) 
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ojprrpicdita arc for dncnbing complex ccologi il phenomena Some 
authors hm however madeaflcmptf at dtfuhrp/ng methods (o draw 
conclusions ami (o nsoicl the pitfiHt Goodmin has presented some 
techniques for Mtimaimg indisidua! comlations on the basis of 
ecoloficnl dtti ustnp both quditatisc and quantilatnc \nnabfcs ” 
In most of the mmlcls trcnied b> him « it assumed that the ecological 
rchtionship is cnl> n tcfletlion of an inUistduantscl relationship m 
ss/iith ease the ccofogic il slope on U. med as an estimate of the m 
dnidual slope Houdon hasdcali with the same t)pcs of problems abo 
et-imminp models contuning structural ertects Boudon presents 
further a general stnteg) for making inferences on the basis of iggrc- 
pile ecological data These tmporijai techniques ire not treated here 
tn detail hut some comments will l»c mule 

Uoudon cximtncs the shapes of the models to determine whether 
the ecological regression of the dependent lanaWe on flic independent 
sanahle is linear or nonlinear and cloiUcs them acctrding!> If the 
regression is linear and certain other conditions nrc met it is inferred 
thU no contextual elTccts arc present and the ecological correlation is 
merely a reflection of the indiMduaUorrelation It seems howeser that 
the conditions presented in ‘idJnion lolinenrit> arc not alwa>ssulTicicnt 
U> sepante all kinds of structural cfTects from the assumed reflection of 
an indiviJui) corrchtion 

One ssiy of making erroneous inferences from the linearit> of an 
ecological regression is nssocnied with the direct strtictunf elTecf (tjpe 
llfc/Tccl by Pans Spacih and lJuson) which docs not influence the 
regression between individual sariablcs and docs not cause statistical 
interaction This model has been prcsentvd as follows (sec Equation I) 


^ e <7 + b,\, + h,T , 

When the area! averages arc calculated the following equation is 
arrived at 

T, e er + bt\, + 6j\, 

Of 

/, = <T + 01 + f, 

It cm be seen that the ccolopcal rcErcssion is linear but the slope is 
sleeper than the tudividual slope (assuming Hint b, and b, have same 

*’ Qootlman ep e/l nntl CeoIoK cal Resresvion and Celia' or onedvidoals 

c™We.lV”te, «(IW rP 6f J-W Thetojen by Duncan e, 

0/ n til pp 6S-80 li based largely on Ihese papers by OoCKlman 

ttoudon op c/r 



68 


TAPAM NALKONES 


signs) Without additional assumptions it is not possible to estimate the 
relatne strengths of individual and structural effects It is sometimes 
possible in these cases to demonstrate the existence of a structural 
effect, as by means of the additional criteria mentioned by Boudon, but 
the problem may not be easy to solve in a general w ay 
Boudon presents a model with dichotomous variables (model B) with 
a structural effect almost similar to the one just treated In his model, 
the ecological regression, however, is not linear This is caused by the 
fact that Boudon uses two parameters to describe the structural effect 
One of them can be interpreted as showing a direct structural effect 
analogous to A, already given, and the other showing an interactive 
effect the within area indiv idual slope is thus assumed to be a function 
of the composition of the areas This model ma> often be more realistic 
than the model w ith one parameter describing the contextual effect The 
lat^ should be, however, fitted first because it is simpler 
There are so many sources of errors in aggregate analysis that it is 
very difficult to make inferences correctly using just one type of analysis 
It thus seems necessary to use flexibly various methods and the partial 
controls given by the authors mentioned earlier New kinds of controls 
Should aUo be developed The areas can be, for example, divided into 
^bgroups. within which separate analyses arc made In addition to 
proximity, as proposed by Goodman and Boudon, other 
cn tena hke size and density of population seem relevant 
eeneral*miii?.«f"! *^°*ogtcai reascarch that our knowledge of the 
collecti\p< in^ analysisand of the special nature of areal units as 
research Methods and principles appropriate for ecological 

both the indiv H In studies with systematic data from 

~a ror example, the nature of 

problems airdSl r a* generality could be examined” The 
this land of research Hoa “ec, of course, much greater for 

other way of deselonitie "''’"We is worthwhile, because no 

eeolo^cal ana?;ts«rafeffS'"’ 

cir pp ^e’es*eal Correlalion," pp f/r,P 623 

nieasurcm*nis and aggregate da^' ^reommends the combination of individual 
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A Typology of 
TcoJogjcai F.ilLic/es 


Many YEARS have passed since W S Robinson slariled dismayed, and 
even inrunatcd man> users of ecological data svilh hjs demonstration 
and "proof ’ that statistical associations for aggregated populations 
might di/Ter m magnitude and c\en m sign from those for individual 
population members ' Now avearcmoreclearlyaivareofthemferential 
limitations and the conceptual advantages of ecological data, and al- 
ternative ways of using such data to infer individual behavior have 
become available But if tl is a measure of Robinson^s success that bis 
findings have now become methodologically commonplace or even 
passe, hfs origmal article »s still worth reading — largely because of 
h)s incisive and exemplary use of simple statistical tools objectively 
to Illuminate valuc-Iaden political discourse 
Starling with Robinson’s well known interpretations ofthecovariance 
theorem, can we find other inferential fallacies equally tempting to 
hasty researchers'’ Might not other statistical tautologies be similarly 
subject to methodologically and substantively relevant interpretations? 

It IS this path of generalizing on Robinson’s procedure that I should 
like to take, seeking also to follow up the more negative implications 


1 “Ecological CerrcIJlion*andlbeBcha’>>ororindi\iduaIs /4/nerican Socw/og/caf 
fieneu. /S(I950) pp 3SI-3J7 More recent reactions or cxiensjonsare discussed in 
LcoA Goodman Some Alternatives to Ecological Correlation ' American Journal 
ofSonohg) 64 0939 ) pp 610 tOS.O D Duncan R P Cuezort andB Duncan 
Suilisiicat Ceograph) Problems laAitaliZti^ Area! DarotCIcncoe FreePress t96l) 
and Raymond Boudon • Propritlis individuelfcs et propnit* collectives un pro 
bICmedanalyscccologKjuc HeiHe/tonraisedesociologie •/(1963) pp 275-299 
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of a typology of ecological falacies with some more positive thoughts 
on how to explore the very questions that we sometimes try too im- 
moderately to answer For if we really know that certain ecological 
associations are false or misleading or spurious, then logically we need 
only to rearrange our knowledge in order to discover the direction in 
which truth might be found 


SOME STATISTICAL PRELIMINARIES 

Mathematical reasoning can only add empty, tautological form to the 
content of our arguments Nevertheless, the clarity, the certainty, the 
abstract generality, and the potential for complexity of such a form of 
expression often lead to, or help con\ey, surprising and worthwhile 
discoveries 

In order to discuss a variety of potentially fallacious ecological 
inferences, 1 find it convenient to use several versions of the we!! knowm 
covariance theorem of elementary statistics From a mathematical point 
of view, once notational conventions are established, these theorems 
combine an elegance of form with a simplicity ordcrivalion Empirically, 
they suggest and help us to organize our thinking about a whole range 
or obvious and not soobvious relationships among statistical associa- 
tions defined at diffcrenl levels of analysis 

our discussion to variables, such as X, K 
^ Exhaustive subsets of these 

are a tnni^'^ /' ''arious disjoint "regions" r, of which there 

unii/in rcp*!! Each observation A", „ /i,, etc , on 

will be con^' d ^ j ‘ ^ assumed to occur at time /, usually observations 

acercpation ^"*1^ involved may already reflect considerable 

“individual ” N definitionally avoided calling a “unit" an 

sent X ’* frequent practice to de- 

collectivdy r * ^ individual / belonging to regional 

be defined Usmrtl!^'‘M '0**>^'duals, regions, or times can easily 
o notation to represent such averages, 

^3y z% individual,, the broken down mlo periods m ihe same 

' a* I me / in period „ be used lo indicate the A" value of onil J 

of I arguments by Karl Deutsch as 

^ and Duncan refrained from sovh an ebbora 

•'^proving the hovievcr for iheir efforts «i 

used throughout ihc prevent paper 
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can sumraarae formulas for J-,,, .V„ A’, , and A’ m rous Al through 
A6 of Table 1. 


Table / 

Statistical Elements of Covariance Theorems 




(Ai) 

Y,.t 


(A2)^„ 


^ The value of variable A' 

. x;*" 'a^n/=N (ory)aUimerrorumt 

^ fT Ibclongms to region r 

/ = I, 2, , / ) 


iSA,,, 

N, Ur 

Where JV, IS the number of units i in 
region r 


The average value of X 
at timer for all Vr units 
, belonging to region r 


(A3) X t 

s.V- 

(A4) A'l 

f 

(A5)A', 

_L sSA... 

TN , « 1 

(A6) AT 

-Lx X ATir. 
r/V(T. i.c 


The average value of \ 

at time r for all N units 

The average value of \ 
foralliimcsrofaparti- 

cular unit / 

The average value of A 
for all units I belonging 
to region rand ail limes 
t 

The average value of .V 

forallunits/and limes 


Formulas exactly parallel __ “ 

The universal variance 

. « ^ \ of A'and 1 forallumts 

i- S (A^‘" " * I (at timer) 

The universal covari- 

, r . rUY - Y) anceof XandYforall 

J- X X randalW 

TN iTi I * • 

The universal variance 
of \ fof 

i. V (A^.ri- /{attimer) 

(A similar csprcssion. 
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Table I — contmtieJ 


Symbol DefintUonat Formula 

Interpretation 

(B4) WCjrr t j s 

The within region co- 

jj 2 W . - - y„) 

variance of X and 1 

(at time /) 

(Similarly, with double sums, etc, we can define H'Ctr) 

(B5) C„, , , 

The covariance of X 

} „) 
l€» 

and Y within a panic 

ular region r (at time i) 

(B6)£C„, , „ 

The between region or 


* ecological covari 

ance of X and Y (at 

(Similarly, we may define, EC,r) 
m TCxr 1 r V 

lime /) 

The between times or 

jTJ 2 2 ix,-x KY.-Y ) 

‘ trend covariance or 

l-l 

A* and Y 

(18) 'XC,, , , „ 


jv 2 (Ar,,-Ar„)> 

variance of A' (at tim« 0 

(Similarly, with double sums and deviations about A' r define IVCxx) 

mecxx, , , 

yiX(x„-x,y 

The between regions or 

‘ ecological variance 

(Similarly define EC,*) 

of A^(at time /) 



(C1)E*„. 

ECxx. 


Cxx, 

(C2)ES,. 

ECrr, 


Crr, 

(Dl) 

c„, 


Vc„, 

(D2> H , 

■ »c„, 


V„c„, 


The correlation ratio of 
X and R (at time /) 

The correlation ratio of 
Y and R (at time i) 

The universal correla 
tion of X and Y (3t 
time /) 

The within region cor 
relation of X and Y (3* 
time t) 
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Table 1 — conunuci! 


Symbol 

Definitional Formuta 

Interpretation 

(D3) ERxr t 

ECxr, 

The bciwctn region or 


VeCr, , x'CCn , 

‘ ecological correla- 
tion of V and i (at 
timer) 

After lime-averaging, formulas Dl through D3 give /?ji, M Rxt, or ERxi 


To generate covariance theorems we need only to define and alge- 
braically to manipulate several simple tautologies Which tautologies 
we choose depends, of course, on the kinds of relationships «e arc 
interested in studying In pirticular, the expression 


or Its slightly more complex version. 

- AT ) =■ (\„. - X.,) + (Af.i " r) + (-'^i - X ) (2) 

gives us a suggestive time specific '‘decomposition" of the moment of 
,Vi,i in Icrnis of within region, across region, and universal deviations 
in A*’ 

An attractive property ot Equations I and 2 is that summing them 
over r gives a simple and meaningful result 

(X, - A' )“ ('i 

In mote familiar form, ignoring lime, Ihis becomes 

(V, -X)-(',-X,) + (X,- ') »") 


.Allhnuahmes, urine 

tinctions susgesieJ by CquaifOiK I * i X,, Y» Y, - A . and 

possibilities by o'"'"'"® „ ha% oscil ibe fina of ihcse expressions 

A-. - A ‘^^7w.«nX.aii5ns m Dcmocracic xouns Thus, 

in their invesufi-mon of cniiwt cl«i»n ocxwiiom 

ihc lauiology. somewhat liKe Eauaiwn 2 + 

an be used lo ivllne a Ltal'<*nL»n rr» .Ml Uw snbjcel lu a,nc. rnncipd 

componcnl anahsis ^ j 

The second icrm Jr. and J A 

conirols for boih indiMdualand « 1888 mS" Animcm riit'ilcal Sfirree 

Mcldrum Critical Etccnons *« Hi'""* 

^eiix*».3^(J960) rP 
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Agam we sec how an overall deviation can be decomposed into an 
individual one and a regional one * 

Covariances arc average products of the deviations in two variables 
about their means In various ways the preceding tautologies can be 
multiplied by an equivalent equation in J, averaged, and simplified 
to give a relationship among covariances of the expressions in paren 
theses Options include whether to average over / or r or both, and 
the choice of initial tautology (see expressions B1 through B9 in Table 
1) Simplifications of an algebraic and conceptual sort arc possible 
largely because so many of the parenthetical expressions have means 
of zero 

The first such covariance theorem we shall present derives from 


* In fact subtract ng Equation 3 from I and rearranging the result gives a more 
complex tautology 

'.oUurci;'.' - J ^ 7j„'L7, Sein'd " " ’’ <^l 

b KAT.) - (A* )) 
universal trend 

■" ">■> c«rr»'<>n 

specif c on« Th,. from universal regional and individual tim^ 

oSr.JSiTS ''rd“cr 

polii cal “ fascinating paper on the locus of 

data S terms of vanS Phenomena Donald Sioki has described such 

cITecis To derne h s model" national slate and (Congressional) distri^ 

constant expressions m ih 4 that summing the three signed 

-A-., vvhich alsrap^n parentheses m each square bracket g.v« 

these three constant or , 'be leftmost expression (Stokes wH* 

thc-T, onthclcnas-a' T^' «Pn»sions rf, D, C, respectively ) Expanding 
indicate that X . coualM it,- to the right of ihcequals sign wouW 

p/uj three new i meless ones!**"!*® *^' **’”* trend components already there 
of the trend expressions so ^ f Second one might want to rewntecaen 
and timeless terms Stokes parentheses contained both temporal 

effects as /I B and C, 'be resulting nai ona! slate anddstrct 

i> = {X,~X) 


natonal s^aic^ /, ^ B, i- C, 

effect dwl” Txzion^\ state district 

C cITect 

‘^f^tani terms 1 — 


mixed time-specific terms 
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Equation 1 s Mullipijrng corresponding expressions for (X„ - V.) 
and - )',) nod aieragmg over all units m all regions giics. 


~ i W„ -X,nY„-r.) = l I (.Y,„ - A-,,) {,•„ _ 

+ ^ i (A-„ -A- ,)(>•„ - y.) + 1 1 (y„, _ ,v„)(, „ _ 


+ 


^ ,7 


(A'rr-A' r..) 


Fortunately, multiplying out and then summing the resulting terms 
tells us that the last two terms on the right of the equals sign, or the 
“cross^levcl” covariances, cancel each other out (In showing this 
cancelation, it ts helpful co remember that the r subscripts refer to 
various specificsubsets of /.and that averages over all / arc equivalent to 
proportionally weighted combinations of "dotAr averages llntermsof 
the covariance notation and definitional formulas Bl. B4. and Dfi given 
in Table I, the remaining terms of the preceding equation indiwlc that 


Cxrt “ I + JIT I (6) 

Equation 6 shows how the covariance of A* and ) foriV individuals/ 
at a particular time / can be thought of as the sum of a Hithm-region 
covariance and a bclwccn'region. ecological one for the same time i 
Either from the preceding dentation or the dcfimtioml formula it 
should be clear that the ecological covariance is a populationAWcightcd 
average product of reciOHu/dcviations m X and I Similarly, the within* 
region covanance can be thought of asa popolation-wcighted sum over 
all regions of the covariances of individual .Vand > values within each 
of the regions 

A slightly more complex (and less famitiar) form of this covanance 
theorem comes from a similar treatment of equation 2 Let us first 
speculate about and then state the results Taking expressions for 
— A' ) and - > ) and averaging them over /xJ/A the individ- 
ual units / and times / should give something like Equation 6 Perhaps it 
will be cxlcndcti to include another term related to the ( V, - V ) type 
of expression that makes Equation 2 more complex than Equation I 


* Since the i sub<«r.pi in (fie Covartaocc Theorem of Ccuaijfw 6 enie^ iis denvs 
lion in onlv a irw.al >*ay. ii shouM heefcar itat iwneWlicr Cuoai^n 3 or 1/ unh 
any suM/vtsfon* otr giv« a muU foewoHy simibr lo Eqoancn 6 
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Wc may call inferences from higher levels to lower levels fallacies of 
• decomposition” or “disaggregation ” The practice of assuming be- 
tween region (ecological) correlations to equal mdividual-level correla- 
tions stigmatizcdbyRobinson.wouldbeanobviouscxample Similarly, 

inferences incorrectly increasing their level of analysis might be labeled 
fallacies of “aggregation ” Tallaeies of inference from one subpopula- 
tion to another at the same level of analysis remain the third general 
possibility 


The Ciological Fallacy 

In particular. Equation 6, or its correlational form Equation 9, shows 
why Robinson could find ecological correlations so dilTcrent from uni- 
versal individual ones The effects of the regional grouping variable R 
and the within-rcgion covariations both interfere with this relationship 
That IS why a small Negro-ilhtcracy individual correlation of 20 
can be magnified by spurious ecological effects to a fantastic 95 
magnitude when aggregated data for a small number of regions arc 
analyzed 


The Indnuhialisnc Fallacy 

Let us reverse the direction of Robinson’s concern Do people ever 
confuse relationships among individuals with those governing aggre- 
gates? Certainly they do, as when ideologically motivated social 
scientists try to generalize from individual behavior to collective re- 
lationships Should wc believe, for example, that because laborers tend 
to be socially radical, there will be an equally high tendency to express 
radical opinions m those economically advanced states with high pro- 
portions of laborers in their populations*’ Or do laborers m the more 
economically advanced states become politically conservative’’ Whether 
or not stales should be treated as mere aggregations of individual units, 
correlations between economic indebtedness and political power, to 
cite another example, may well be very different for an international 
universe of citizens than for a congcric of nation states within which 
citizens arc indebted to each other 

Using Figure 1, we can see how both these fallacies are aggregative 
ut isaggrcgjtivc attempts to equate universal and ecological covari- 
anccs Further patterns of plausible but logically unproved inference 
may illustrated if wc decompose wilhin-rcgion covarnnee into 
population-wcightcd covariances within particular regions (denoted in 
formula US of Table I as C„,) Expanding the (I'C,, term of Equation 
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u" snra'’" ^ 'trms (r = 


ll'C„ 


A 

r 


V V’ " -^'rfX J'lrf “ } ».) 
r^. A' 




When parf(culA-ir weighting coemcicnls N, proportions, have been 
omitted in ngure I a special addition sign, ©, has been used Note also 
that time-specific subscripts have again been removed 



ngure J 

A TyTOI-OOY Of ECOIOOICAL pALUCItS 

The basis of this figure is Equations 6 and 6a A shghily ditTcrcnl version of ihis 
figure fs given and discussed m my A faf/iemanct am/ Mitics (Sew York Wactni/fan 
1965) pp t02 105 


Cross Le\e( FaKams 

Perhaps the most obnoxious fallacies of aggregation have been false 
generalizations from individual rchiions within a single region to a 
universe of intercollectivity relationships The most striking example 
in the eyes of some, in this regard, concerns giving to states the attitudes, 
the responsibilities, and the allegiances due to individuals — not all 
individuals, but some of them within a particular state 
After critically discussing work by Kaplan and Deulsch and other 
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advocates of a general systems approach, Harold and Margaret Sprout 
have recently argued as follows. 

the issue remains whether one derives clearer and richer insight into the 
operations of political organizations by endowing them even metaphorically 
with pseudo-biological structures and pseudo-psychological funclions . 

[The use of such psychoecological terms] tends Jo retfy abstractions and 
to divert attention from the human persons who ultimately decide what 
IS to be undertaken In the case of the slate, reification buttresses ethnocen- 
tric values and images which characterize so much of the discussion of 
foreign policy, military defense, and international politics m general 

Cross-level inferences raise a problem for any general systems theory 
For even if all “regions of data” obey the same set of relationships, 
the schema of Figure I shows how, even m a simple logical fashion, 
aggregated, higher-order systems may evidence different associations 
The most enduring debates along these lines continue to be about the 
similarities and dilTercnccs of individual, community, and national 
models of social and political systems 

The Unnersal Fallacy and the Selecine Fallacy 
Closely related to the problem of making valid cross-level inferences 
are the problems of generalizing relationships from subsamples and of 
representing in one particular subsample a universally true relationship 
If the subsamples are randomly extracted, only ordinary problems of 
statistical inference exist, but nonrandomly selected subgroups raise 
the likelihood that other variables, perhaps associated with the parti- 
tioning relationship, destroy any such easy correspondences 


® Harold and Margaret Sprout, TIk Ecological PerspecH\e on Human Affairs 
\uili Special Reference to Inlernational Poliiics (Princeion Princeton University 
Press, 1965) pp 20S-210 

While diisaivjCMtmsmyseU tswn\Vfts\tw\Vvi\VruTnanpsycho)ogitsi\ and biological 
metaphors do not add rich insights into the behavior of ‘ healihy* or ‘‘pathological” 
states. I do accept the charge that humanly described states appicar to be beyond 
the control of individual human actors as a serious problem But il is important to 
distinguish the analytical problem of correctly discovering relationships among 
states and connecting these relationships to those among individuals from the nor- 
malise questions of the obedience and responsibility due to reified and personalized 
colicctisc authorities 


The ^ct that associations at two levels arc, m general, unequal sshen the high- 
lesel sariabte is an aggregate of the lower one suggests but does not prove that simitar 
generalizations might hold for a large number of structural variables that are not 
mere aggregations The fact that some of these examples do refer to such cases 
wggcsis that these mfercnccsarc equally difficult Similarities or differences may well 
w cio«iy related to problems of context, to be discussed shortly Erwin Schcuch 
has helpfully emphasized these points m commenting on my paper 
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increasing incomes in the “culture of poscrij may also be a con- 
teatually fallacious assumption Although the examples become hartler 
to come by, one can extend such a list almost indcrmitcly 


Vie Cross-Sccitonal and Lon^iiuciinal rallacics 

As Duncan and his associates ha\c done, s^c can interpret Equations 
6 and 9 as if temporal “regions” were >cafs, and we can generate time 
senes correlations from the ecological terms ’ ^ Then we could interpret 
the cross Icscl links as indicating the unequal relationship between 
cross sectional and longitudinal covariations Even though the in- 
adequacy in most eases of inferring instantaneous relationships from 
longitudinal data or vice versa is well known, the temptation is almost 
irresistible The peculiar advantage of Equations 6 and 7 is that they 
show quite precisely why and to what extent such inferences arc valid 
or invalid 

Rather than present a number of examples, 1 should like briefly to 
discuss Equation 7's expansion of the regional and temporal covariance 
into within region ecological and trend covariances, each of which can 
be further broken down into withm-ycar and or withm-rcgion terms '* 
The great merit of this expansion is that it shows simultaneously how 
all sorts of fallacious simultaneous cross-level or within-lcvcl inferences 
can be further confused when longitudinal data is considered 

The simplest form of mechanistic assumption we can make about 
human behavior is that it obeys linear additive determinative relation- 
ships — m fact, we assume this, at least approximately, every lime we 
calculate a product moment correlation coefficient and impute to it 
causal significance Much more closely corresponding to the insights 


Duncan et op cn , Chapter 3, and ' Resionatiim >cr%us Uni%crsalism.’* 
op ni Thc« sources give empincat evidence on when and v-here cross-sectional 
and longitudinal covariances have and have not corresponded 

“ An analytical technique making use of these kinds of problems and taking 
account of lAithin region and bet^veen region di ffe renc e s is Gibsons continuous 
'o ' c, Lazanfeld s “Utent Class Analysis “ which he calls * Latent 

Profile Analysis The algebraic lauiologies from which model equations are derived 
arc not unlike those given earlier Their distinaive feature however, is the use of 
triple products of the form 


y)(z, z) 

^ “Regions" arc constructed as subgroups 

Tnlf N If ^ product covariances disappear See P Uzarsfeld 

structure Analysis to Quantitative 
Ec^ralDaU inF Massar^ndP Raiooshfcds) Maih^tical ExphraUans 
in BehoMoral Science (Homewood III In*m Press. 1965) 
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Of biological and cybcrnciically oncnfafed theorists is the idea of 
contextually contingent dcicrminativc relationships, which themselves 
ma> e\oKe or devolve through time as the result of certain feedback 
processes For example, explicitly temporal models allowing for region- 
ally or contextually nonadditive effects help us belter to model the often 
delayed impact of fluid soent structures on the class-to-part} or ethnic- 
to-party voting relationships that we sometimes assoente with V. O 
Kej’s concepts of a critical election tf the most imcrcsling soci d and 
political phenomena are those in which the rules change from context 
to context, then statistical formulations that do not allow for such 
processes arc painfully irrelevant 


FROAf ECOLOGICAL FALLACIES TO CROSS-LEVEL 
HYPOTHESES 

It may seem piradoxical to say that the discoiery of error is the 
beginning of wisdom, but this is. in one sense, an implication of Herbert 
Simon’s brilliant .ilgebraic treatment of causal inference problems in 
“Spurious Correlation A Causal Imerpreiation Jn a similar fashion, 
docs not Robinson’s demonstration of the existence of a “fallacious” 
ecological correlation point toward the discovery of the true pattern of 
ecological relationships’^ 

Recall Robinson’s examples Individually, Negroes showed only a 
small tendency {Rxr = 20) to be more liliteraie than whites, eco- 
logically, however, the tendency appeared extremely strong {ER^j 
95} The result provides almost unavoidable political ammunition for 
segregationist arguments — unless the ecological fallacy argument can 
be made credible Less likely to appear in campaign oratory is the 
pro-immigration .irgumcnt that foreign-born Americans are much less 
likely to be illiterate than native-born {ERxr *= 02) , but even this, too, 
IS an ecological fallacy {.Rxv ~ 12) 

Let us speculate about these results, especially the politically loaded 
one Our liberal sensibilities make us feel that, even for 1930 data, the 
ecological Ncgro-illitcracy correlation is a false one Moreover, some 
knowledge about the different areas of the United States of the 1930 s 
in which Negroes and illiteracy were concentrated, those that were least 


Repnmed m Models of Mon (New York Wiley, 1957) irrtpl.eat.ons of 
Simon’s insighls ha^c been more fully broughl oui m Hubert Jr Couwf 

hMcnces w Honesper, mental Research (Chapel Hill UniwRily of Norih Car^ma 
Press 1964). and A Ando 1 M Fisher, and Smon, Essays onthe Structure 
of Social Science Models {CAmtsnd&s MIT Press, 1963) 
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urban^ed and mdustnalized, suggests that the ecological pouping 
variable probably hides the causal effects of urbanization, bringing 
about le .5 illiteracy and the tendency toward the exclusion (in Robin- 
son's prewar data) of Negroes from industnal opportunities If there 
IS a real but small correlation of Negro race and illiteracy, as 
Robinson obsened. we might say that even this small association, 
when all effects of ecological groupings are taken into account, is 
spurious Thus we should expect that WRxy, for small and homo- 
geneous enough regons, would beesen closer to zero than Rir for 
individuals’ 

Using the correlational sersion of the covariance theorem, the argu- 
ment might go lihe this 

20 = V\^E\b Vi - + 95 Etb 

For the reasons given, v.e should expect much — say half — of the 
variance in X (being Negro) and Y (being illiterate) to be ecologically 
determined, that is, ECzxfCzx and ^CryfCrr would be about 5 But 
then the preceding equation Iwcories 

20 = 75 WJtxT + 24 

from which it appears that IVRxr *= 05 « 0 (In fact, for the four 
macroTcgtons that Robmson does publish within region data, the aver- 
age within region correlation is considerably smaller than 20) Now 
what docs it mean that *5 approximately zero‘d From our 

previous formulations, especially Equation 6a, we hnow that we can 
think of \yJixj as an average of wjthm-rcgion correlations Also, we 
hav e argued that the nominal regional grouping variable R acted some- 
thing like industrialization and/or urbanization in its decreasing effect 
on iffiteracy and something like traditional ruralism in its depressant 
effect on Negro cducaiiona) aspirations and opportunities Therefore, 
treating the regional distinction /? as an aggregate interval scale index 
of industrialization and urban opportunity leads us to interpret lYRxr 
Si 0 as meaning that R somehow accounts for, averages over, or 
“partials out” whatever real correlation there is between X and Y 
That IS, in partial correlational language, 

Rxr B S: 0 

This result is, in turn, Simonas rausal prediction equation associated 
with the cross-level h>’pothesis of a spurious causal relationship 
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between -T (Negro race) and K(iHitcracy) controUing for (industna! 
opportunity)” 



In summary, we have expanded the ecological fallacy argument into 
two stages to show bow ts totaffy misleading, less so, and 
least of all The first stage, reducing the significant correlation 
from 95 to 20, was the one suggested by Robinson, tlie second stage, 
reducing it further to about 05, seems to follow from and complete the 
intentions of bis original data analysis and methodological orientation 
But, m fact, it derives from what has come to be called the “Simon- 
BlalocK causa! modeling'* tradition ** 

One begins to wonder about the extent to which the literature of 
ecological analysis might develop along different lines if all investigators 
were always aware of both kinds of fallacies — those of aggregation 
and those of spurious correlation If we were to continue to develop 
this line of reasoning, we might possibly benefit from our previous 
discussion m two significant ways 

First of all, modeling crossdeve! processes would more naturally be 
assumed to be of significant theoretical interest When, for example, 
do national and/or regional factors dominate election behavior? And 
when do tliey reflect, via aggregation, the summation of local and indi- 
vidual decisions'’ Variance components models such as Equation 4, 

5, or 8 help us to begin to locate elTective political decision making 
(unfortunately, we cannot empirically choose among these models, at 
least until additional variables are introduced) Causal modeling with 
dilTerenl levels of expression, such as (ATj — ) and (K,„ — K,,), 

should help us move further m this direction, and it should benefit 
from a knowledge of the logical relationships that might exist among 
such terms 

But the preceding analysis, particularly of contextual differences and 
efiects, leads me to expect that nonlinear causal models will be necessary 

See, for example, Blalock, op eu , Chapter 5, where in making causal mferenecs 
he uses ^gher levels of aggregaiion lo cancel out processes lhai are random teiMfcn 
regions I have summarized and applied son» of the relevani Iiicrature on recureivT 
and nonrecursive causal models m "Oiusal Inference and Political Analysis’” in 
Joseph Bcrnd(ed ), Maihemarical Applteaims in Fohucal Satnet {DiWis Souihcm 
Methodist Univenily Press 1966) 
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,f we are fully to take into account the nch variety of etl«ts due to 
geopolitical contert and the socioecological environment This second 
implication of this reanalysis also remains to be explored 


/p* wntten, R_ Botuion. L’analyse mathfmatique det fails sociaux 

(Pam Plon. 1967). has made a number of impressive coniributiom along these 
lines. S« also A. L. Stindscombe. Coiujruamg Social Theories (New York 
Harcourt, Brace & World, 1968) 
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Individual Behavior and 
Collective Properties 


Thejivt paptti pmtMtd m Pan Two foon on o etuclo! qatnion of research 
strategy the optimal stratifieJ sampie st/n^s of the national popu’ 

htion anil ecological analyses of data for o iriJe range of territorial units i*liklii 
each country. 

The typical sample surrey eanctntrates or* tndiriJual-letel ranabks lueh as age, 
sex, education, denomination, occupation and adds tety httle Information about 
the social and cultural cniircnmcai ulthin uhlch each indiridual pursues bis 
acthltlcs. Bui it It u’cll knoun that the hchauorofan indirtdual is rarely a direct 
funetion of his abstracc position hut it hearlly eonditwneJ by his immediate social 
context If makes a great deal of difference to the cultural or the pohucal 6<fio» ior 
ff on indmdual wbahtr be or hs household is in a minority or a mojoniy position 
ivit^n the giren social context, the Catholic tends to behaxe In one h(^ fn a pre- 
dominantly Protestant in anoiAff Jn a thoroughly secularized enrironment, 

and ogam In another in a komogeneous coaiext if like-minded people, itmlfoiff, 
on tffee employee's behanor is guise different in a x-orking-elass district from his 
behavior In an upper-middle-eloss area. 

The papers by ]uan Lmz, Emm Seheoeh, Kevin Cor, ycon Laponce, and Dand 
Segal and Marshall Miyer ell offer a variety of trampla of the Infuence of the 
ecological context on the behavior of Individuals and, m foci, eonstiiule a pewttful 
enttgue of the "atomizing" model underlying the standard natiomeiJc surrey. 

This, of course, does not imply any rejection cf the sample survey as a technlgue 
of data gathering. To the contrary, the papers ell call for surrys but in a new 
sampling frame; thy call for ccok^icaJljr coniroDed surveys Innead of 
••otomittic" ones. In his rich and suggestive paper Juan llnz sorts out with 
admitoble claiity the pros and cons af the twa mmpetlng methods- the nation* iJe 

S7 
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anJ ,he ^ ^ '"Z ‘''I “ 

fina/ vercf,«/or or against any one of these but argues persuasively for the joint 
use of the two procedures for the organuation of sursf^s within a wide range oj 
culturally, socially, andjor politically distinct ecological contexts 

The papers by Cox and by Segal and Miyer gger examples of ecological controls 
in survey analysis Cox shows how information about the majoriiy-minorli/ balance 
in each precinct adds a decline dimension to the analysis of thefow of political 
communication Segal and Myerdemonsiratctbe importance of ecological variables 
in the reanalysts of survey data 

There has been much emphasis daring the last ten to ff teen years on the dangers 
of the ecological fallacy l/ie papers by Alker, Allot Jt, and 1 alkonen in Part One 
remind us fore fully of this ujue But it it time that we react uifft equal Insistence 
a^oinit tfic opposite error, the individualist fallacy, the tenden^ to generalue 
fndingsfor individual behavior without controlling for the c/iarflrtfrijfj« <f the 
immediate social, cultural, or political eoniextt Thcfwe contributors to Part Two 
all agree on tftif point they all reject the assumption of the ‘individualist surtry 
£nvin Sc/](uc/i hot perhaps gone further than anyone else in this direction, his 
initial denunciation of the individualist fallaiy in his contribution to the volume 
Comparing Nations has been widely gaoled as a sign of a change in the trend 
of research In his paper in this volume he formulates his position wfilh great sue- 
einetness “In ordinary surv^ research the individual is usually treated, albeit 
by imp/ictjiion, as an atom Just adding atoms does of course not make for mol* 
ecula " 

Jean Laponce adds a further inJKtmrnt against the indiscriminate nationwide 
suTvy he underscores the dangers of handling national averages for countries 
marked by strong r<^iona/ contrasts In countries such as Canada, France, Italy, 
Spam, India, or Mexico, any attempt to draw conc/ujionijrom nationwide sampling 
ivi// be highly hazardous without built in control; for regionally jpcci^c contexts 
In these countries the correlations between sociocultural characteristics and political 
vorJobJe; tend to wash out within the total ooiiotviZ Dimple Tb* cccielctwnt come 
out dijferently for each region because the »xiocuhural characteristics and the 
poJiticoI rcjtrirtionj both take on d^erent concrete meanings according to the 
geographical, and therefore btstancal, context It does not mean the same politieally 
to be a peasant or a worker in Tuscany as In Calabria, in Andalusia as in Catalonia, 
in the Pas de Calais as jn the Limoiain In France there are, in fact, cases where the 
„ilh,n „ ryicn ore mailer than the Jtfferettcet 
among regiont Jot the lawe taclaeitllaral eategaiy Thtt, afeaurte, daet not itiean 
t at tocloeidluial dtjjerencet do not count »i(hin each region the categonct timply 
tote on diftrent ireonin ^; from context to context 

The need fit eonlexiaally tnnlrolled tnrrcyi ttndt 10 he panicuinrif urgent 
nheieter the pnaatj lector mil vreight hennly in the balance of political fereer — 
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in fatty in aU tounirks v/ith tnaticJlj uaJerJt*eIopai reliant The tame gaet far 
countries ivitft a fiistoy' of teiumiat cufnira/ oppotiuont SfonJorJ nafionrt’lJc 
sur\c)t of the United Stoics tenJeJ to treat the South at if it irere just dm«Aer 
region, the recent attem^ifs to otersamp/e the South and to strati^ ^ the local 
hlegro h hue balance testify to the Inadi quoiy of tbit procedure 

For all these reasons direct comparisons <f nationwide surrey data u III tend to lead 
one astray h many cases the better strategy trtU prate to be in the Jjrecfson of 
cross regional surv^s across scicral couiMrics— survys conducted In rejions or 
Similar leiels of dcrelopment in several toantnes, far example, Tuscarg In Italy, 
Andalusia In Spain arid the Limousin in France, or the Fjnnmark In Uoriia^, the 
Norr/onrf in Sweden, and the LapplonJ in Finland 

In sum, this section if our tolumc pleads for ecological surreys against m 
dividualist surveys, for cross regional analyses againrt cross national The 
survey ecology polarity seerns in us profoundly false IJ bat is nee JeJ is a strategy 
of ^stemotic linkage a Jehbetate e^rt to merge information for reieral levels of 
larialion 
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Ecological Analysis and 
Survey Research 


INTRODVCTiON 

If we turn to the history of research in diverse fields of sociolosical 
inquiry and to the emphases of the difTcrcnt centers of learning, we 
discover that ecological analysis and survey research have developed 
separately, even when they have studied fairly similar if not idcnijca! 
problems Sociological ecology has a longer history than the sample 
survey But the dominance of the latter type of research m recent limes, 
particularly in the United States, might give the erroneous impression 
that It would ultimately displace ecological analysis The argument of 
this paper is that both methods have advantages and disadvantages for 
the study of certain problems and that in many cases the two could 
complement each other and 6e combined in a rt-uifrui" way rheorcficaf 
and practical considerations make such a marriage of methods desirable 
But — as vve shall see at length — such a combination of methodologies, 
with their traditions and distinctive requirements, is not without strain 
An awareness of the difficulties may contribute toward preventing a 
quick divorce affer a hasty and unpreincdilafed union based on a rosy 
vision of the future 
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The scarcity of studies combining both approaches’ can be attri- 
buted not only to the separate intellectual traditions but to the ‘•f* 
at an early stage of our knowledge each of those methods could by 
Itself contribute much to our findings To mention one example, any 
reader of the resiews of research in the field of political behavior is well 
aware that the same propositions or generalizations are documented 
b> reference to both sur\e> and ecolo^ca! studies ' Andre Siegfried 
(1913), followed R Heberle and the earJ> more quantitati\e studies 
of W F Ogburn, Stuart Rice (1924, 1928), and their students at 

Columbia H Gosnell at Chicago, Samuel Eldersseld, and particularly 

V 0 Key arc quoted simultaneously vsith the sui^e> studies of the 
Bureau of Applied Social Research by P F Lazarsfeld and his col 
laborators, and those of Angus Campbell and his group at the Sur%ey 
Research Center of the Unt\ersit> of Michigan Also cited are the 
findings of European survey research organizations such as IFOP, 
DOXA, DI\0, the Instilut fur Demoskopie, ei al In fact, scholars like 
V O Kc> turned from ecological to sur%c> data m the course of their 
ov.n V ork ’ However, no major work published to date (except some 
work in progress at the Survc) Research Center and some articles of 
the regional panel project at the Bureau of Applied Social Research) 
has combined both ecological analjsis and survey research system 
aticallj in the same study * 


‘ Anong the excepuons may mention Henry Valen and Daniel Katz, 
Poliiieal Parties in hor^a^ A Cammunty Study {Oi\o Universiietsforlagel 1964) 
a study that combines attention to the ecological comexl with the analysis of survey 
data Stem Rokkan and Heno \alcn. Regional Contrasts in Norwegian Politics 
A Rrnr# of Data from OlTKiaiSfaitsttcs and from Sample Surveys ** in Enk AHardt 
and Yrjo Litiunen (eds ) Cleorages ideotogies and Party Sistems Contributions to 
Corr-vraii e Pol tical Sociology UichmU Westcrmarck Society 10 19M) pp 162 
23? tctespcciallv pp 187-20) The class fication of communities by the availability 

of the part) press serves as contest for the analvsisofsoting behavior m S Rokkan 
and Per Tarvvik “Dcr Wahkr der Lever und d c Parteipresse Kolner Zeiischn/l 
fur So wl getmdSo lalpsychohg e /2(1960) pp 278 ff Norman M Bradbumand 
David Caplovilz, Reports on Happiness A Pilot Study o/Dehanor Related to Mental 
Health (Chicago Aldme 1965) pp I 5 selected as the contest for their survey 
commin ties difTercnil) affected economic depression 
* Paul F Lazarsfeld. S M Upset Allen Barton and J Unz, “The Psychology 
ofNotng An Anabsis of Political Behavior'" in G Lindzey (ed ) Handbook of 
Social Psychology 2 (Cambridge Addison WesI^ 1954) pp 1124-1170 
in/.\ ^ f Optn on and Ameriean Democracy (Sev. York Knopf 

1^1) after hiv monunvnial SwarArrir Po/r/fj (New York Knopf 1949) 

y c and the Representative S)stem of Govern 

^ in AUardt and Lmunen op e/r, pp MJ 376 relates data obtained by inter 
® sample of the eleaorate in their constituenaes with the 
r>r V i into account the safe or marginal character 

l d vtfKt dfterm n-d on the bas s of voting statistics The studies of the Survey 
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The main questions \ve can raise arc 

1 Are !he ti.o methods mlerchingeaWe m their results, and ,r so 
M h> should we prcrcr one to the olher*^ 

2 If the findtnes are not always mtcrchangcabJc, which would be the 
intellectual rather than the strictly pracljcal advmnges and disad 
vantages of each’’ 

3 Granted that Ihcj arc not miercbangcableand bothhuedistinclive 
contributions to make »s there any particular advantagemcombinmg 
their use in a single research design 

4 It one assumes valid theoreticaland practical reasonsfofcombmmg 
the two approaches the mfrcqucncy of this combination suggests that 
there must be certain difficulties involved What are those difficulties, 
and how could we attempt to circumvent them’ 


SITUATIONS FA\ ORING ECOLOGICAL APPROACHES 

Certainly there are problems for which both methods can be used 
and in which the choice will depend largely on strictly practical con* 
siderations and/or the personal preferences of the researcher But in 
other cases one or the other method wilt have distinct advantages and 
will impose Itself Of course, to estimate the political preferences of 
broad strata of society — such as the working class Catholics in Ger 
many (see the cxceltent early study by Johannes Schault)* — both 


Reicarcli Cenler of ihc Vnncnity of NJich gan have on occasion labulaied dala 
from surveys using charactcnsucs of the county or communuy as a confexi fbr 
CKumpk when ihey fabulafe awarerwss of cand date mrormaiicm by Ihe fad ihai 
Ihc incumbcm lives in the same commumiy ((caving as de large urban centers) asm 
Donald C StoKes and Warren E Miller Parly Governmenl and Ifte SaJiency of 
Congress m Angus Campbell Phlip C Onvmc Warren E Miller and Donald 
E Stokes Deetfont artJ i/ie Pffiir/ca/ OrdtTfNfwYork Wiley 1966) pp 20S-209 
or when in studying the Economic Aniccedenis of Pol Ucal Behavior they relate 
the pcfcentagc of unemployed m a covntry lo Ihe atntudes toward governmcnial 
guarantees of full employment as in A Campbell tl at Thf Amentan I o(er (New 
York Wtley 1960) pp 383 384 See also Philip H Ennis The Conicxiual D men 
sionin Voting inWilliamN McFhec and William A Glaserfedsl Public Op man 
and Consresshna! Ekcthnf {G\cnco(! Free Press. 1962). pp 180 2J1 for a more 
extended version see his unpublished Columbia University Ph D dissertaiion in 
sociology 1961 , _ 

’Johannes SchaufT Die dcutschea hatholiUn and die Zentrumsparm Line 
pi3liUscbst(iii^iischeUnierJiielMngdcr}{ejclsiags>nihhn’:eitlB7l{y-.o\n J P Bachem 
J928) ll should be noted that SchaufT in the absence of survey daia wanting to 
know how many Catholics voted 2cnirum developed a mcthodoloiO' very simily 
10 the one used by Mattel Dogan (without knowledge of this work) in his ariicle 
Lc vote oavrier eit France analyse icologiqoe des ctcciions de 1962,” fiewie 
Franpaife de SocioloBie 6(1965) pp 435-472 
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approaches can and have been used In the ease of more specific 
population groups like the “occupational communities” of nimcrs, 
fishermen, etc , living in distinctive communities, or even neighborhoods 
(such as dock workers) ecological data offer a more economical way to 
obtain the same information National sample surveys generally would 
have too few cases in such numerically small but sociologically and 
politically interesting groups However, sample surveys among such 
groups, in such constituencies, once identified by ecological research, 
would have the advantage of allowing us to inquire into the motivations 
of the voters and to use other “intervening variables” in the analysis 
It would allow us to use the panel method to study turnover and to 
check the “ecological fallacy” in its crudest form, as pointed out by 
Robinson (even when survey analysis has its homologous fallacies) 

The impossibility of using survey techniques, owing to the lack of 
organization of or freedom to use them or a high rate of refusals or 
nonresponse on certain questions (such as on Communist voting in 
France and Italy) may force us back to using ecological data (Sec, for 
example, the ecological studies of leftist voting in France and Italy, by 
Mattel Dogan )• Also in the ease of numerically small social groups 
living in distinct areas, ecological data would allow us to reach generali- 
zations that numerically extensive sampling would make too costly 
In an exploratory and descriptive study, an ecological analysis can 
also be extremely fruitful m pomling to the variables that might be 
important in an explanatory analysis using survey techniques that the 
researcher unfamiliar with the phenomenon he intends to study may 
not have considered To profit from this advantage of ecological ap- 
proaches attention should be centered not only on the correlations but 
also on the clusters of deviant cases we arc likely to find in a scatter 
diagram of the relationship between two variables An analysis of the 
characteristics of those units — departments, cities, etc — for which 
the over all relation docs not hold can be helpful in locating additional 
explanatory variables Often the hypothesis derived m this way from 
the ecological analysis will be testable only through survey data 


^ Lipvcl. Affl/i (Garden Cily Doublcday, 

iyw) passm 

mlh.sMmcxolum= 

socialedci sufTragi m Iialia m A Spreafico and J LaPalombara (cds ) Elezioni e 
Comun.ta 1963) pp 407-474 See also 
'"1- Hamon(cd) Us nou^eeux 

bS; pT m 
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Ccolog) and the Stutf} o/ RuraJ Society 
The characteristic dispersion of relative])' small numbers of people m 
rural societies and the wide variety of social characteristics over many 
easily identifiable and often fairly homogeneous administrative (and 
therefore statistical) units make the ecological approach particularly 
useful in the study of rural society In fact, only with extremely large 
samples (without previous ecological information) and in only a few 
homogeneous societies would it be possible to represent and identify 
adequately all the types of rural social structure and class relations 
The dimensions that have to betaken into account in the study of rural 
society are so numerous, their combinations are so complex, while 
their geographic distribution is so far from random that a normal 
survey may miss many of them Dimensions like patterns of (and 
tenure (property, tenancy, and other agrarian contracts), settlement 
patterns, tjpes of crops and their rotation, agricultural density, mcN 
dence of economic crises due to climate or market conditions permit 
many combinations that determine differences in altitudes and political 
behavior The vvc.iUh of statistical data and maps for al) these indicators 
for small, relatively unpopulous administrative units permits us to 
identify a great number of such types in a way that the data from a 
national sample survey, without many sampling points and a very large 
sample, never would permit ^ 

Furthermore, m contrast to the urban social structure, m a rural 
community the various strata and social groups are visible to each other, 
therefore the particular mix of systems of land tenure, etc . is much 
more likely to have a "compositional effect”’'’ on people who occupy 
comparable, if not identical, socioeconomic positions m villages of very 
different social structure The aggregation in survey analysis of such 


’ToIkcDovnng Land and Labor in Carapf. 1900-1950 A Comparatht Suree) of 
Retent Agrarian iJisiorjr (The Hague Marttnu* NyholT 1956). Wilbert E Moore 
Rconaniic DemagraphrafEauern and Sontfiern Europe (Geneva League of Nations 
1945) Pitirim Sorokin and Carle C Zimmerman Fnncip/esof Rural VrtKmSo<w/ogy 
(New York floli. J929). Arthur L Slinchcombe. ‘Agricultural Enterprise and 
Rural Chss Relitions," A/mr/fw/Jaurmi a/ Socurlac}. 67, 2 (196]) pp J65-176 
■« Patricia Kendall and Paul F LaraRfetd * Problems of Survey Analysis,’ in 
Robert K Merton and Paul F Lazarsfcld (eds ) Coni, milder in Social Research 
Studies In the Scope and Ueihad af The Amerkan Soldier (Glencoe Free Press 
1950) pp 187-196 PaulF Laiarsfeld ' FlroWems m Methodology m Robert K 
Merton, Leonard Dcoom and Leonards Cottrell Jr (eds ) Sociology Today (New 
York Basic Books. 1959) pp 39 7S sceespeoallypp 6^73 The same ideas were 
developed further in P F Uzatsfcid and H MciucI, On the Relation Beincen 
Ind.vidu3landColIcciJvePropcrl.es.' .n AmiUiEuion. (cd ). Complex Organrallons 
(New York Holt. Rinehart & W.nslon. mi), pp 422-440 Among ihe articles on 
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people without reference to the commumtj context ma> be thoroughly 
fallacious For example, in Spam a poor peasant otsner in a small 
Galician hamlet finds himself in a completely different context from 
a man classified in the same way Ining in a large Andalusian sillage 
dominalcd by large landoaners and with large numbers of farm labor- 
ers “ The social context of the village becomes a prime determinant of 
the occupants’ different political and social behavior In a national 
sample survc> it seems improbable that the data should permit easy 
identification of such social structural contexts, s^hilc they arc obvious 
to anyone at all familiar with the wealth of available ecological data 
In this perspective it is not surprising that in the past, national 
censuses m semideveloped or underdeveloped countries have been less 
satisfactory for describing the rural than the urban social structure 
However, the combination of census data with the data of agricultural 
statistics, land registers, tax records, historical data on property rela- 
tions, etc , facilitates the study of rural society in a way that national 
surveys would not It is no accident that ecological research has con- 
tributed much more to our knowledge of rural politics and social 
structure than has survey research Certainly it cannot exhaust the 
topic, but anyone attempting to use survey research in this area would 
be ill advised if he were to start his study without an cfTorl first to 
consider all available ecological data and to design his survey taking 
them into account 

Wc must add practical reasons to those derived from the nature of 
rural society based on the diversity of soils, climate, and the history of 
rural property relations, which are less subject to evolutional change 


methodological problem? la the study of * sinidural'* or * composilionaP efTecls, 
see Peter M Blau “Stniaural ElTcas.' Amencan Sociological Revie*, 75 (I960) 
pp 178-193 James A I>avj$,JoeUSpaeih and Carolyn Huson, ‘ A Technique for 
AM^ng of Group CbmposmoT,** Amencan Sociological Reiter, 26 

(1961) pp 215-^ Arnold S Tannenbaura and Jerald G Bachman. **Slruciura! 

vepus Individual Effects.'* 59(1964) pp 585-595. 

^11 so'tK points »n the previously mentioned article 

llmona de hi agitacionei campesmas andaluzoj — Cor- 
^(Madrid dc D-recho Pm-ado. 1929) fora description of such soaal 

stnwurcs and their political implicaticms 

Agfioillgral Basis of Enioiional Nalionalism." Fublic 

OpmmQmrlnly.Irnyi) p 50 IT. Rudolf Haberfc. 

0™'>vheVc,la£vu„u,ll 1963) S« also 
Ite °°re°" J Frame The Femamry in 

rh,to SI™ "r UmvuTs.ty Pru,, I9M), pp 258-259 

Va!:, S"ra4^;.Tp 
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,n US analysis the behavior of the anonymous masses rather han to 
limit Itself to that of the elites who have left us personal tiocuments 
Voting records, police reports, parish registers, property deeds, t 
records, etc , allow us to reconstruct the relationships betsveen dilTcrcnt 
aspects of the social structure — political behavior, violence, religiosity, 
family structure, and the class structure — for periods long past These 
data are forever out of reach of the interviewer of the individuals whose 
thoughts are not recorded like those of their governors, priests, or 
intellectuals To mention some specific data relevant for the political 
sociology of societies that made the transition to modernity in the 
nineteenth and early twentieth centuries, we have relevant data on 
voting, violence, strikes, political crimes, parly and trade union member- 
ship, as well as data on the social structure from censuses, ofilcial social 
surveys, tax, educational, and religious statistics, etc. The possibilities 
for correlational analysis, ecological estimates, analysis of variance, 
factor analysis, etc , for data extending over a period of 100 or more 
years are practically unlimited for most Western countries The study 
of strikes m Britain by Knowles’* and a pioneer article by Hobsbawm” 
show the range of generalizations that could be based on such data 
Such an analysis combined wuh the study of trends for such ecologically 
defined units over long periods of time would permit us to study more 
efficiently the processes of social change, tracing (for selected districts) 
the relationships between behavioral and altitudinal changes, and 
structural changes like urbanization, industnahzalion, the opening to 
communications, increases m literacy, economic crises, and the diffusion 
of ideas from the center to the periphery (as suggested by the work of 
Stein Rokkan) Such a correlational analysis would allow us to dis- 
cover the time gaps between structural and “superstructural” changes 
(along the lines suggested by Marxist theory or the “cultural lag” 
theory of Ogburn), or situations like that described by K. Mannheim 
m his essay on Conservative thought:” the diffusion of ideologies (or 

“ Kenneth Knowles Strikes A Study in Industrial Con/Jicl, mih Special Reference 

to pnmh Experience between I9H and J947 {OxTord B Blackwell, 1952) 

E J Hobsbawm, * Economic Fluctuations and Some Social Movements Since 
1^, m Labouring Men Studies mibe History of Labour (London WcidcnfeldA 
Nicolson 1964). pp 126-137 

D ‘TT’c Mobilization of the Periphery .’‘'m Stem 

Kokkan (ed ). Approaches to the Study of Political Pariictpalion (Bergen The 
Michclscn Institute, 1962), S Rokkan and H Valen, "Regional Conirasls in Nor- 
Politics m Allardl and Utlunen (eds ), op cil 
Karl Mannheim ‘Conservative Thou^l, m Paul Kccskemeti (ed ), on 
Sociology and Social Psychology (New York Oxford University Press, 1953), pp 

74-164. especially see pp 120-125 
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institutions) before the structure to uhich they should respond his 
actually developed The latter situation ma} be quite frequent m some 
latecomers to modermiy 

Using such ecological units « wouW be poswble to develop a kind 
of ecological panel following the model ofthc work of Lipsct on voting 
m Southern counties m the United States before the Civil War,’° in 
ihe secession rcferendi and afienvard In this way it is possible to apply 
to ecological units the logic of panel and multivariate analysis and to 
explore the impact of great historical events and crises on communities 
or districts of different predispositions, at the same time taking into 
account their social structure 

The “contemporaneity of history*’ that soaal sctentKls often discover 
belatedly m their research makes ihisadvantage particulirly important 
If iheresearchoncontc/nporari behanorami social structure cannot ignore 
the u eight of history u cannot ignore ecological data 

Ecological data acquire particular relevance for cross^natmal com.- 
parisons of sequences of social and political development Overall 
national comparisons suffer from the disadvantage thit few if any. 
countries as a whole find themselves at the same stage of social and 
economic development in a comparable historical context, while parti* 
cular regions or communities uithm several countries may find them- 
selvcsin asimilarsituation Thefcrofecomparingiheimpactofdifrerent 
institutional arrangements (political, legal, and religious systems) m 
corresponding social structures would be possible 
Such cross-national comparisons of smaller units, as well as com’ 
parisons of intranationaldiffercnccs of several nations,** would contrib- 
ute more substantially to our knowledge of social change than 
comparisons between nations as a whole Such a comparative ecological 
approach to the social, economic, and political history of France, Italy, 
Spam, and Portugal — countries that share such a large historical and 


M Lipset The EmefKCTVce of ibe One Party Soulli TbcElcciionof tStO,- 
In /’ti/u/fo/ \Ian pp at Chap XI PP W4 354 
»>J LmrandAmandodcMiRUcl WjilWn Nairt*n OjiTfrenccvanifConTpanwrfv 
The Eight Spainv in Richard L Memn and Stem RoUan <cdi » C.^mpanng 
teotions The Vse of Quaniuaiiie Data bi Croti \aihntil Fe^forth (New lUvetv 
Yale Universilj Presv 1966) pp ^67 319 Aho LnL Allanil I/rrJ.Mi.onv of 
VV nhm Nation Varutiom and Regional Imhatanm for Crow National Ke^rsn. 
inIbUf.w 33T-34? for exatnrlcj of how “exmimgcno — m lerniv of di' ercnl 
joemCTOgmphc conievls - efrcci* are isnoted cr relaimnvhirv miMmcrrnrfcd 
naiional averagw afvo the sceimn or. aieertarnme 'tw J miiv of '« 

Hayward R AUrr.Jr, The Companwn of Aggircaic Ptif ncal aid S^xiaMlJU 
PotemiaIit.es and Problems.- in Stem RoVUn led ) Coo’paronre Ratatih Aiwi 
Cuharrs anJ Satlom Moulwv I96S) 



100 


JUAN S LINZ 


msrnut.onal her.tage, but whose dcsclopment, both 

pohttcal, proceeded at such dtHcrent paces -would destroy many 

myths derived from over-all cross-national comparisons 

A particular problem revealed by survey studies that could be clanficd 
by ecological research concerns the continuities and discontinuities in 
party ,denuJicat,on as a determinant of political attitudes and behavior, 
independently from initial or subsequent changes in the social structure 
Such identifications of many districts ucrc made in particular historical 
crises and have persisted ever since The problem of such coniumttes 
and discontinuities in some districts, which at first sight seem to ha^ a 
similar social structure, has intrigued social scientists (This has been 
noted, for example, by Key and Mungcr“ as a factual basis for ques- 
tioning some of the more simplistic social determinism that has crept 
into voting research using survey techniques ) Only long-term ecological 
research can contribute to our knowledge of the problem of continuity 
and change in politics This knowledge would be crucial to an under- 
standing of the politics of countries in which the democratic political 
process is interrupted for a long period of time or of situations in which 
some districts respond quickly to changed political appeals by parties 
and leaders while others remain loyal to their first lov c, even when cither 
parly or leadership composition and/or the political appeals have 
changed 

In such cases of continuity in political loyalties, the question that 
ecological data will allow us to pose more adequately is Why did forces 
dominant m large parts of a country persist in some places, while in 
others they were displaced or absorbed by new movements and parties'’ 
The comparison could contribute to a belter understanding of the orig- 
inal strengths and weaknesses of a movement that is being reduced to 
marginal survival, as well as of the factors underlying its persistence 
The ecological data can give us important leads on how to explore the 
conditions for local fractionalization of politics versus nationalization 
of politics through absorption of marginal elements by dominant 
parties Such patterns also can tell us much about the social structure of 
a country, its cultural and social integration, later to be explored more 
intensively by survey data The one-sided focus on class cleavages, which 
certainly are dominant in modern societies, has led to a neglect of 


” XrP Mungcr. * Social Dcienninism and Electoral Deci- 

sion The Case of Indiana,’ in Eugene Burdick and Arthur J Brodbeck (eds ) 
American I of Ing BehoMor (Glencoe Free Press 1959) pp 281-299 

continuities and discontinuities m political behavior, sec Lazarsfcld 
ft aJ , “The Psychology of Voting,* in Lindzey, op cit 
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lertical cleavages, often correspondmE lo {or strongly cnrrclalcd wlh) 
spatial alignmenls Ecological data covcrins a long time spin ran give 
us many cues for the study of tlie factors tliat determine tradilionalism 
m a changing society and that are lilcly to be related to patterns of 
social mtcgralion,” organreational strength, etc , but are often neglected 
in survey research 


SITVATIONS FAVORING SURVEY RESEARCH 

Survey research has the unique advantage that u allows us to obtain 
much information about the same individual rather than only about 
aggregates of individuals, and to group those individuals according lo 
our interests rather than in a predetermined \say as in much of the 
precollectcd ecological data for admmis(ruti\c units Furthermore, 
surveys permit us to introduce a large number of infenenin^ tanablcs, 
particularly psychological variables that vve never would have in ccologi- 
cal studies, between the information about the social position and the 
recorded behavior The relationshipbctwcen independent and dependent 
variables does not have to be inferred mdirctely by the researcher but 
can, at least m part, be ascertained directly b> using the inforniation 
the respondent gives us about h« attitudes or predispositions, his 
perceptions of the reality alTeciing his bchtvior. his motives as he secs 
them, the inferences he himself makes about the relationships between 
his position and his behavior Not only can we relate the informatJon 
about his position to that about his behavior but we can study lo what 
extent the perception of that social position accounts for his behavior 
To pul It graphical!}, with ecological data it would be almost impossible 
lo study “false consciousness 

Thesubjective reality — wh.al is real in the minds of people — is also 
real in its consequences, and that realit> will generally escape us when 
we use exclusively the usual aggregate data about people on which 
ecological analysis is based Certainly survey research maj miss the 
unconscious motives of people, or their subjectively honest rationaliza- 
tions, to which psjchoanalysis has called i>ur attention Hut these are 
even more likely to he neglected in ecological approaches Often the 


For Mampte. « «ouId be pwbic lo rtbie ihc daia on ihc vk of s«ul 
isolaics •• definc^l by ihc fa'queno of coiKaitgumows marr, ifcs. ihe 
ofC, Dahiberg, MaihMrul /-r Crnrw. (Lendon Inter 

science Publisficrs I'M?) and J w 

popubnon winwnum. 6 (1951), r? retat/ftc 1^ lo 

acceptance or tes.Mance lo chance le. »«* ownnwm and d.,coni.nu.i« In 
potiucat toyslties 
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comparison belvscen ecological data for aggregates and those for 
ind.Mduals m those aggregates inevitably points 
intervening variables that foree us to modify the inferences « " ™ “ 

tempted to make on the basis of ecological data . 

have a region with a high per capita income in a countiT m "''ich the 
income has been rising but we do not find this reHected in answers to 
questions about improvement in standard of living over the last years, 
we can suspect that the perception of reality is affected by frames o 
reference, reference groups, etc, which in turn may be related to the 
ideological orientations of people Such variables would explain wy 
inferences about economic improvement and change in political be- 
havior may be misleading The empirical test of such thcorclica 
constructs, however, will be possible only through the use of survey data 
In the majority of cases the inferences based on survey data arc at 
least as good as or generally better than those a social scientist can make 
about the relationship between social position and behavior on the basts 
of ecological correlations Certainly those inferences are less likely to be 
vitiated by a rationalistic bias than those based on ecological data 
Often the same objective (in the view of an outside observer), social 
position, is dinerenlly perceived by those in the specific social position, 
and those differences m perception account for difTcrcnccs in behavior 
that generally would elude ecological explanation 
The direct rather than the inferential linkage bclvkcen individual 
social position and behavior is the great advantage of survey research 
over ecological analysis The exclusive reliance on objective data 
characterizing aggregates without inicrvening subjective variables that 
underlies the inferences based on ecological data makes them particu- 


larly susceptible to the dangers of the ecological fallacy, even when the 
inferences based on aggregations of survey data arc subject to compar- 
able fallacies 

We have already noted the possibility of using ecological units for a 
kind of panel analysis of aggregate data to explore problems of change 
over time (In fact, the first long term attitude change studies, by 
Newcomb, of classes of Bennington College girls were based on turn- 
over tables of data on aggregates rather than individuals, and in this 
they were very similar m their logic to ecological panels One of the 
difficulties with such comparisons along the time dimension of ecolog- 
ical data IS that they assume a certain continuity m the composition of 


M Newcomb, Personality and Social Change (New York Dryden 
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the population of the units under attalj-sis, either of the individuals or of 
some characteristics of the population, if they csiend over more than one 
generation 

The relationship between predispositions as w^l! as the exposure lo 
events or stimuli and changes m attitude or behavior is possible to 
assess only wi th sur\ eys made at two points in time Only (he a\ liability 
of records with information about the same individuals at two points 
in time (such as registration of voters, academic records, etc) would 
allow a comparable study of change to the panel analysis of suncy 
data 

Survey data are undoubtedly threatened by the possibility of dis- 
honesty among the respondents and those collecting the information 
However, in most cases this problem can be exaggerated in dealing 
with large aggregates of randomly selected respondents More than 
cutnghl }y(ns the ptvir!cfn rs crasrw responses TVte no answem. no 
opinions, refusals to be interviewed, when they exceed a certain level, 
have to be treated as distinctive responses that may often stand for 
quite well-defined attitudes discernible by relating them to other 
responses m which people did not feel the same need to be cautious or 
reticent For some patterns of behavior about which we have both 
survey and ecological data, the discrepancy in itself can bean important 
factorforresearch and contributes loourundcrsiandingof the phenom- 
enon Certainly, if m a country we find communities where the 
ofilcially recorded Communist vote is much higher than the number of 
voters who, in a survey, are willing to admit havmg voted Communist, 
while in other towns the two sets of data coincide, we may ask ourselves* 
What distinguishes the confessed and the discreet supporters of the 
parly? Probably the significance of being a Communist is not the same 
m those two types of community In this case the data alert us to the 
existence of different types of Communist voters The comparison of 
both types of communities vviH tell us something either about the 
probable characteristics of those voters — their occupation, their 
religion, their past voting, etc — or about the community structure — 
the mihcu m which those voters live —and thereby about characferisiics 
affecting ihctr responses, such as the importance of social coniro)s and 
pressures for concealing an unpopular choice or loyalty to party in- 
structions not to answer We would expect the first in small ruraf 
communities, areas of strong religious and clerical tradition, and the 
second m strongly Communist urban industrial districts 

Finally, let us note that the same dangers of dishonest) threaten the 
original cwilogical data These data, after alt, arc often collected by 
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covemment agenaes, usmg methods no d.lTerent from those of poll.ng 
agences Often the methods arc relatiscly unsoph.st.eated ^ 

respondent to feel even more strongly that 
negative eonsequenecs for him, particularly in the 
oLmment agencies cannot always be trusted more than private 
organizations or social scientists in collecting accurate information 


CIRCUMSTANCES FAVORING CITHER METHOD 
IRRESPECTIVE OF ADVANTAGES IN PRINCIPLE 

Sample surveys are subject to a number of practical ditncuUics that 
may force us to turn back to an ecological analysis, even when they 
would provide us with more and better information Wc have already 
dealt at length with the case of historical data and the study of rural 
social structure To those categories we have added all the societies in 
which there are no competent survey research organizations or which 
would not allow the researcher to carry out surveys but which have 
published a wealth of data for relatively small administrative units If 
ViC were to rely only on survey research, vve would be obliged to leave 
out of comparative social research many countries and problems that 
otherwise can be studied, even though to a limited extent 
Survey research, for reasons intrinsic to the technique itself and 
connected with the organization and financing of research, often 
cannot be used to study the impact of certain events or crises But often 
ecological data become available to rcfiect such impact There are also 
certain social changes that occur so slowly, almost imperceptibly, that 
even a panel analysis would not be able to pin them down But long- 
term ecological data would clearly reflect them Wc have the feeling that 
some of the changes m demographic behavior in the last decade would 
fall into this category This probably would be the case with fertility 
patterns of different social groups 

There are also data that the individual with the best intentions could 
not give us with exactitude because the information is not available to 
him, though official records contain it An example would be a study of 
the health conditions of a population where the morbidity and health 
statistics and the records of hospitals and doctors may give us better 
information than individuals or their families and where many of the 
factors accounting for the situation (such as number of doctors avail- 
able, health insurance affiliation, number of beds, and economic 
conditions affecting health) arc easily available for aggregate units 



ECOLOGICAL ANALYSIS ANt> Sl/RVE> RESEARCH 


105 


Ecolosical cori-elaliiMs for such dala may be the most helpful in an 
CApJoratorysJtfdy’^ 

A problem not easy to solve is the case where in addition to aiiitude 
data we would like to use a large number of precise data of a census 
type dilDcult to obtain because of the time it would take For example 
m a study of rural social structure we would like to relate sociological 
data and attitude information with \ery exact data on land use, crop 
rotation, machinery used, labor inputs, cattle, production, etc, of the 
type collected in the lengthy questionnaires of the agrarian census or tax 
records The combination of both sociological and agrocconomicdata in 
the same questionnaire interview becomes prarHro//> impossible Often 
these data are available in public records, o/Ticwl statistics, orspcctahzed 
surveys With some indicators of that type in the survey (asked in a 
form identical to that of the ollicial staiisiics) and data for small 
enough ecological units it might be possible to circumvent this difficulty 
to some extent 

Ideally we should be able to combine the information obtained from 
altitude surveys, which are focused on more strictly sociological 
problems, with the more exact records that different administrations 
carry about those same individuals A certain limiting case is not likely 
to occur in the immediate future in *Jiberar’ societies because of public 
resistance, and probably not m planned societies because of bureau* 
cratic competiuon In this case it would be possible to accumulate all 
the sociologically, economically, and otherwise relevant information 
on any single indn idual (or organization such as the business firm or the 
hospital) under a single number We can dream of and fear a situation 
in which It would be possible lo pul together on an IBM tape all the 
population census information the social security admmislralion data 
(with the complete employment record), the income tax statements, 
health data military records, as we)! as academic career data including 
those of psychological tests administered in thecourse of the mdividuil s 
life It requires little imagination to realize the studies of social mobility 
that this would make feasible If. then we would ask m survey studies 
on politics, values, etc , for the identifiration number, there would be no 
difiicully m combining both sets of data and exploring with incredible 


For an cxvmpte see Fundocion Fnmenio de Esiud.os ^lalcs Y SN.olfig.a 
Aplicada In/orwe social igico safire /« situmiAi saciaf dc Cs[mm Eui^ 

amenca 1966) .raltzed by DATA &A under the direcuon of Amandodc Miguel, 
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Some of ihose dala iben arc related lo those obtained from a national sample survey 
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mecsion hypotheses like those about the dilTerencc between 
Ltus pohb« or status mcons.stene.es Teehn.cally, ^ 

w.th.n our reach .n the United States the social security 
serves to record all changes .n employment .s now used m all ncornc 
tax statements (which include information on ‘""f ’ 

certain types of expenditures, etc ), and great universities have begun to 
use the same number to keep academic records The ' 

attitude studies could probably obtain that same number and make me 


combination suggested 

However, in the immediate future such data from public records wil 
be available only in aggregate form, assuring the anonymity of indi- 
viduals, cither for certain strata or for ecological units or. most fre- 
quently, a combination of both The smaller such units (occupational, 
income, and age groups) and the ecological units (cities, counties, 
census tracts, etc) are, the greater the possibility of combining them 
With the data from surveys for individuals In some countries (like the 
United States) the social scientist not only will depend on the published 
data but will have access to the original IBM cards (\Vithout identifica- 
tion of individuals) or to samples of such data to do his own analysis 
Certainly such data analyzed for ecological units in a way comparable 
to the survey data will facilitate their combination This will not allow 


the matching of individuals in the official records and those inter- 


viewed in altitude surveys of a private character But certainly the com- 
bination of aggregate data from both sources will become increasingly 
feasible However, this will require important changes in the design of 


survey research and the construction of questionnaires 

An effort could and should be made to obtain data from government 
and other records to combine them with those — generally attitude data 
— acquired by academic social scientists through surveys This would 
require certain safeguards, like eliminating from the newly created 
record (tape or IBM card) any information that could lead easily to the 
identification of the person, as well as agreeing not to make the attitude 
data available to those agencies that might take advantage of the 
records Limitations like those restricting the publication of statistics 
for groupings of less than a certain number of firms should be established 
in these cases to protect the rights of individuals or organiMtions 
In this changing context the researcher accustomed to working with 
attitude surveys, particularly those coming from a psychologically 
oriented tradition, will have to pay more attention to the work of 
human ecologists, urban and rural sociologists (human geographers in 
many European countries), demographers, and economists A thorough 
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knowledge of government rcconis, population censuses, and other 
ofTicjal statistics, the methods used in thcir coKcclion, and iheir serious 
weaknesses for sociological analysis will become more important It 
should not be foi^ottcn that the otGcial character of such data, the fact 
that they are based on cnumenition rather than sampling, is no guaran- 
tee of their reliability and validity and that before using them one should 
be aware of their limitations as the experienced survey researcher is 
aware of the limitations of his data 


T/IEORETICAL REASONS FOR COMBINING ECOLOGICAL 
AND SURVEY DATA 

The combination of both types of data grows not only out of the 
complementarity and the need for a better use of available information 
about most societies but out of an important change m theoretical 
orientation m sociology TTie advance m knowledge, partly due to the 
accumulation of data, as well as of cross-national research, has made us 
aware of the need to specify further many of our generalizations by 
taking into account contextual variables, milieu, institutional settings, 
absolute and relative rales of change, reference groups, membership 
groups, etc , in understanding individual altitudes and behavior 
StruciuraL/unctional analysis, on the one hand, and the emphasis on 
social interaction, on the other, have stressed that the individual and his 
behavior cannot be studied in isolation from his social context^^ and 
from his interaction with others Sociometnc data allow us to place the 
individual m a network of social interactions, but they do not take care 
of the more diffuse mHuences of the milieu** or of global properties*’ 
of the milieu, of whose influence the individual might not be aw are even 
when they affect his attitudes and behavior Persons in apparently the 
same “objective” situation — m terms of occupation, skill, income, 
education, social origin, religion, etc — will think differently about 
their position and react to it diffcrentlydependmgon thcsocial context 
Corrfext rs cfeif neif by sttuirates sach as wkeiher those so placed are m a 
majority or a mmority m their community, whether they work in large 


Robert K Merton, 5ecia/ Theory and Soaot Siruuure (Gtcncoc Free Press, 
rev ed , 1957} see Chap J "Manifesl and Uleoi Functions, ‘ pp 19-«4 with a 
bibhographjcal postscript on sirwiural functional analysts See pp 7_-73 on 
“sifuciural coniext ” .u -«.* 

« On the concept of reference gfoop, milteti etc sec Conlribulions lo the Theory 
of Reference Croup Behavior.” and * Continuiues in the Theory of Reference 
Groups and Social Structure,” m ibid, pp 28I-3S6 See p 3M on roiheu 
** See references in footnote It) 
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or small plants, vrhelher th=> are present in d.fTerent 

Ihev interact or are isolated To emphasize this contextual 

elOTcnt in social reahtj ssas one of the great 

DurVheim-s Suia* Such a context can affect the 

consciously or without their awareness, a difference that ofte 

coincide with the interaction with others as a mediating factor or a more 

indirect influence of the milieu 

In the ease of interaction with others and awareness of the impact oi 
that interaction, we speak of “interpersonal inffuence,” opmiOT 
leadership," “pressure toward conformity," etc These are likely to 
rcNcaled by survey research in a community context, particularly when 
sociometric techniques are used Thesituations where there is interaction 
but no av-areness of mnuence on particular attitudes or behaviors^e 
often more surprising to the social scientist but no less important The 
concept of reference group is often useful in accounting for these 


situations 

The situations where there is no direct interaction also are generally 
those where the individual is not aware of how the social, institutional, 
and organizational context, and even the physical setting of his life 
afiect his attitudes Obviously there arc situations in which theindividual 
can be aware of how he reacts to his environment without forming his 
opinions through interactions with others in it Ecological data are 
likely to reveal such compositional eficcts but unlikely to distinguish 
these four kinds of situations Ecological approaches therefore are more 
likely to point toward the need to search for structural effects than to 
account for their presence 

Even the fact itself — independently of the awareness of it — that a 
great majority or a small minority m a community (or organization) 
holds the same opinions will affect the social relations of those holding 
them and their “way of holding** their opinions, degree of conviction, 
resistance to change, etc In other cases the social context, of which the 
ecological context is a particular case, will allow us to redefine “ap- 
parently** equal social positions as difTcrent, without the need for many 
additional questions For example, a railroad worker of the SNCF of 
the same category, income, age, mantal status, etc , in its Western or its 
Southern division is basically in the same social position as his counter- 
part m another division but m a difTcrcnl context This will be reflected 


Hansn C. SeUin. “Durlhcjm* Sairife and Problems of Empirical Research 
Jountaf c/Soria/ffrf.6J (t9SS) pp 0)7 619 Also see S M Lipset and 
Neil J S-Tcher. Sochfozy 77-e Progrtit of a Dtcode (Erglcv-ood Oiffs. NJ 
I*Tcn'ict Hall. 1961). pp 132-145 
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m his vote But the AndaJusian farm laborer and the one Ijvrng in 
pstdle are really in two basically different social positions, masked only 
by a superficial classification based on occupation and income We 
should be clear when we are speaking of context in a stnet sense and 
when wc use it as a shorthand It> define more precisely different social 
roles that are only apparently identical in a standard national sample 
suncy 

In practice, the distinction we just made often is very fluid, and to 
decide whether we are confronted with one or the other case might not 
always be important Confronted with the second case, we shall have to 
use more refined categories, additional information for the individuals 
studied, in future research when we might not ha\e contextual informa- 
tion (for example, ecological data) to correct our overly simple cate- 
gorwation of social reality It might ser\e as a rule of thumb that when 
the differences in attitudes or behavior m different contexts of those 
classified m a certain categorj* are much greater than those between 
categories (such as broad occupational groups), we may want to rede- 
fine the classifications used Strictly defined contextual effects should 
generally be minor We arc more likely to be confronted with them 
when we find differences m the same direction m all or most groups in 
Ihe same contexts, e>en when they are also situations where contingent 
effects are involved 

The information about the social context, including the ecological 
context, can be derived either from prepublished data (censuses, 
statistics) that we combine with the survey data (on the basis of the 
selection of sampling points wiihm such contexts or byour Knowledge of 
the location of those points) or from the aggregation of the data on the 
bam of the mtcrtiews made wilbm units or communities we want to 
consider as a context In both cases wc can use averages as well as 


In a study of Spanish bositKssmen by Juan J linz and Amando de Migud, we 
cbaracienzed ihe business communny of diflereni industtiat centen as more or ks% 
socially integrated — wc could also say cohesise — by Ihe proporiion of entrepre- 
neurs who claimed that ihc majority of their friends were businessmen Theaifitudes 
and bthavior of the men of affairs m those communiijes were diffcreni even among 
those who Ihemselves had a simitar proportion of fnends in business among (heir 
five best friends See Juan J Lmz and Amamlode Miguel loi Empresones ante e/ 
Poder Publico El Udera-go y (os Grupos de hKrdses en e( £f»pcesar,ado £spauol 
tMadffd Insiituio de Esiudios PoHlicos 1966) pp 277-279 for a list of articles 
based on this study in which compansons often are made bet wn eninrpreneura in 
different ernes or indosina) regions For a specific esampte of how the 'nfcgrahon 
alTecis the perception of (he prcsiigc of fi»e ebie occupaiions. including lhat of ^e 
importanl^smwsman skJ J LiniandA *M.guel 
en ^mondo cmpresarta/.’' £emra de Esiudass J^ltkos, J28 and 129-130 (1963) 
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measures of d.spers.on to charactenze the “P™™ “J 

social structure that vse want to consider as a contest Certainly tn 
information obtained by both methods can be used ° 

the same characteristics and their correlation might be 
to gam better insight into the milieu Sometimes a discrepancy g 
force us to question the validity of one or the other, and in other i^es d 
might suggest a discrepancy between the “more objective eco ogi 
indicators and their “subjective" perception or description by re- 
spondents For example, we may characterize a community by the 
inequality in the distribution of land ownership and at the same time by 
the perception of that inequality by a sample of farmers Both data do 
not have to coincide, since for a number of reasons the discernibility o 
the inequality might be different A particular way to characterize a 
communiti is by the perception of consensus or dissensus and to 
compare that perception with the results of the actual responses on the 
same issues (for example, the agreement with certain measures like land 
consolidation by farmers) in that community — in other words, by 
assessing its climate of opinion 

As another example, the crime rate for a district does not have to 
coincide with the perception of the frequency of crime by those living 
there Areas with equal crime rates might be characterized by very 
different perceptions To account for them would be important, and 
without doubt the perceptions arc likely to have an impact on politics 
if crime becomes an issue 


Let us emphasize that ecological data (from precollected sources, 
mainly official ones) and survey data are not always interchangeable in 
defining an ecological context This might well be the case with the 
“latent” memories of the past, which are not easily verbalized or 
forgotten but affect present politics, such as the areas of Finland where 
Communists were shot by the White Guards (used by Allardt),^^ or 
areas where the Depression hit strongly, or which had a particulanstic 
past of anti-Prussianism in Germany, etc It would be interesting to 
compare to what extent, in such ecologically defined areas, the memories 
of the past hase survised and to act upon the present through awareness 
of that past Historical ecologists ha\e pointed to the second approach, 
but Its delimitation from the first is possible only by the use of sursey 
data Certainly there will be cases in which the “weight of the past” will 


t- V^agner Threlens. The Academic Mind (Glencoe 

j discuss this problem of inierchangeability of indices 

LnX Allardi. “Patterns of Class Conflict and Working Qass Consciousness ii 
Finanh Poliucs, in Allardt and Utlunen (eds ), op cii . p 123 
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become evident only through ecological anal>sts. while m others the 
nif moire eoUecti\e is also an indisidual memory 
Another example of a situation in which ecological and sur\cy data 
are not mterchangcable but complementary is in the study of deviance 
andnormatnebchjv/or In thisarexsurvcydataand official data often 
in ecological form would give us very different pictures, cither because 
of evasion of responses or because of the importance of differences m 
perception m cases of conflict of interests For example, the number of 
labor law violations conflicts between workers and cmplo)crs about 
wages, etc , m different districts can give us some interesting information 
about labor relations Undoubtedly on the basis oftheir own experience, 
businessmen cannot generally give us adequate information about the 
proportion of cases before the labor courts that arc decided favorably 
or unfavorablj for either businessmen or workers They can tell us 
their own experience (often too limited and distorted by their feelings) 
and, what is almost more interesting, their perception of those decisions 
The comparison between those perceptions and the actual data for 
ecological units would reveal situations of discrepancy m both direC' 
tions ** The situation where many businessmen beheve that the courts 
are overwhelmingly pro-labor when they are not is certainly not the 
same as when, with an identical pro con ratio, they believe that (he 
courts are fair Labor relations that objectively may be similar are 
Jikclj to have, in both cases, vcrydifTcrcnt political consequences 


SXAMPLES OF ECOLOGICAL CONTEXT DATA IN ANALYSIS 
OF SURVEY DATA 

Lei us exemplify our discussion with some data from a recent 
national sample survey in Spam In it we have (he responses of men in 
different occupational groups that reflect the class differences in the 
society The combination of these survey data with the information 


** For example see ihe attempt lo compare survey and ecological dala on labor 
Court decisions tn Spam JuanJ LmzandAmandode Miguel “El mpresano anie 
fos probtemas laborales RerMa tif Minca Sociot dO IMadnd 1963) pp ^105, 
sec especially pp 93 tOl and on aiiuudcs toward labor representatives In the 
enterprises and pariicipalion m trade union eicciions ihe same authors U repv 
resentactftn sindicat vista por los empresanos Fownio Social 78 (J965) pp 

^ *^Thcsc data were made arailaWe lo the author by the rourtesy of F^dac^n 
FOESSA (Fomento de Cstudios Soaales y de ^lologu Aplicada) and of Dr 
Amando de M.guel Director of DATA S A. who direcied the stu^ and whose 
coKiperation is graiefully acknowledged Neither of these sources is responsible for 
ibe interpreiaijon given here to the data 
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about the economic de\elopmcnt of the province of residence, as 
indicated by the provincial per capita income data,“ allows us to tegin 
to test propositions about the consequences of modernization n 
attitudes, values and behavior One characteristic of a modern society 
_ almost irrespective of political system - is increasing equality ot tne 
atizens before the law Another is the greater emphasis on achiever) 
rather than asenbed qualities, on universalism rather than particularism 
on the greater predictability of life and consequently less feeling that 


success depends on luck 

These propositions v.ould lead us lo expect different responses i 
questions touching on these dimensions among men in comparable 
occupations in prosinces of high and low per capita income In t e 
following tables (Tables 1 and 2) we have divided the respondents into 
se%en occupational groups by their residence in provinces of low, 
medium, and high income In general we can expect some kind of linear 
relation between proNincial economic development and attitudes 
within each of the sesen occupational groups (Incidentally, such 
contextual effects may not always be linear, particularly when we can 
distinguish clearly some of the more extreme contexts in which other 
factors may become relevant, producing a U*shapcd pattern or a 
reversal in certain groups at the extremes ) The data presented (Table 1) 
show that provincial wealth is associated m all occupational strata (if we 
compare low- and high income provinces) with a larger number of 
people feeling that they would receive equal treatment by the police in 
an incident such as a traffic violation ’’ As a matter of fact, in five 


*• The classification of the provinces by per capita income is based on the study 
of ihe Banco de Bilbao Fenla Saeional dr Espana y su distribucion proMnctal 1962 
(B Ibao 1963) p 16 The provinces m the lov* income group are all the Andalusian 
except Sevilla, Eslrcmadura, two of the south central meseta, and three of Galicia 
Tbe high-income proinces arc Madrid, Barcelona, and Gerona m Catalonia, 
\izcaya and Guipincoa m the Basque country and the Baleares In this case opera 
lionally defined units coincide largely sMth actual geographically contiguous pro- 
vinces 

'^Th' question t*as taken with a slight modificaiion — leaving out the second 
example or a “miror violation — from the five nation study by Gabncl A Almond 
and Sidney Verba. The Cme Culture Pohtieal Attitudes and Democracy in fire 
Aaffonr (Princeton Princeton Univenity Press 1963) See pp 106-114 for a com 
pa-ativc analysis of the findings The percentages expecting equal treatment in Italy 
an^g those with no education and with some university education u«e 27 and 74 
and 6*1 among those with some secondary education, in Slexico the proportions, 
^ no-education and university-educated car 
^ w compared with the farm laborers and free professionals m Spam Ir 
tne United Kingdom those with pnmary education or less were 8S per cent tho« 
TO 96pcrcent,inV.estGermany they were respectively 

70 and 88 per cent. 
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groups the relal,oush.p ts a hnear one If we had the 
occupation, we could not be sure that they would not be due to the 
dilTeLt proportions of those engaged in them who live m 
and underdeveloped regions If we had only the data ‘ J 

regions of unequal development (see bottom of the tables), we cou 
never be sure whether they were due to the different proportions 
privileged or underprivileged strata living m them The data as presente 
allow us to slate that both occupation and economic dcNclopment are 
related to the perception of equality before the law in Spanish society 
Certainly in some strata the expectation of being treated unequally is an 
expectation of disadvantage, in others one of privilege (We may note 
that in such an underprivileged stratum as the farm laborers in t e 
underdeveloped areas a larger minority expects favorable treatment, 
perhaps counting on the weight of particularistic friendship tics) 
Among free professionals, small businessmen, self-employed workers, 
and farmers, their status m the less developed provinces seems to assure 
them a more privileged position The same is true for white-collar 
workers and civil servants and bigger businessmen, if we compare their 
responses m low- and medium income provinces with those m the most 
developed ones Another indication of the importance of the contextual 
factor is the minimal proportion without an opinion on the question m 
developed Spam and the almost consistently higher proportion in the 


low income provinces 

It might be argued that some of the contextual effects that this 
combination of ecological and survey data has allowed us to discover 
would disappear if we redefined the occupational groups with greater 
precision Certainly the categories “farm laborer” and “farmer’ 
comprise, m different regions of Spam, people who should not be in- 
cluded m the same occupational grouping With more cases in each 
group we might have gone a step further by taking within the 21 groups 
we are considering only those with an almost identical income However 
justified this may be in analytical terms, by doing so we risk losing sight 
of the social reality that these different provincial milieus represent, 
even when they m turn are a reflection of the different occupational 
structure that is associated with economic development, as revealed by 
the absolute figures m parentheses, on which the percentages are based 
In underdeveloped Spam, fewer people in all strata believe in full 
equality before the law, but the larger proportion (19 per cent) of 
underprivileged farm laborers in it must contribute to that climate of 
opinion and to the police mores that the answers may more or less 
accurately reflect 
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Quite consistently with the basic homogeneity of industnal society 
and the great heterogeneity of rural social structures uc find that the 
differences by context among manual workers are relatively small 
compared with those among farm laborers and farmers The great 
differences in the expectations of free professionals — an occupationally 
relatively homogeneous group — in different social contexts is par- 
ticularly noteworthy Certainly the elite position of the professional m a 
semidcveloped society, as the poorer provinces of Spam can be con- 
sidered, becomes apparent from the data, and ue may add that they 
support our image of the professions as part of a premodern society 
compared to the business bourgeoisie 

Let us stress that these findings would not have been possible without 
combining the survey data with ecological data since it would have been 
difficult if not impossible to derive the economic development of each 
province (or better, sampling point) from averaging the responses on 
income of the respondents in them Naturally, there are situations m 
which the contextual vanablcsc an be derived from the data of the 
survey, particular!) m studies of people m organizations, work groups, 
or elites But this is not likely to be the case in national sample surveys 

The question concerning equality of treatment by the police probably 
reflects m part (he reality, in part differences in values affecting ex- 
pectations, and It IS likely to be somewhat affected by political attitudes 
(as the slight increase of the “worse” response among workers m the 
developed regions suggests^ Another question*’ about the factors 
accounting for success is even more likely to reflect differences in values 
between a developed, mainly industnal society and a more traditional, 
rural one 

Before we enter into a detailed analysis, it might be noted that despite 
the differences in economic development (sec last three rows of Table 2) 
there is an over all national pattern of response Luck still occupies an 
important place, though most marked among the less privileged, 
particularly for the rural population for whom irregular crops due to the 


**On this point — ihe distinction bel»*wn a bourgeoisie based on a modem 
economy and the middle classes {efases maiias) — ‘whose existence and growth is 
largely independent of economic doelopmeni sec Juan J Linz and Amando oe 
Miguel, • Within Naiicm DifTcrmces and Comparisons The Eight Spains in 
Merritt and Rokkan (cds ) op ril , pp 285-29J . , , t 

»»This quesiion is a modified version of the one used in a study of values of 
college students See Rose K Goldsen Moms Rosenberg Robin M Williams Jr. 
and Edward Suchman What Co/fere Studntts ThitX {Pnnevton Van N«irand. 

1960),p 18 The same question has been asked in Spam in a Naiional Youth Study 

and in studies of secondary Khool and uraveniiy students 
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Spanish climate make this a not so unrcaJjsiic answer The tivo most 
clearly untversalistic criteria, hard work and intelJjgencc. e\en hi a 
semidevcloped society, arc already prominent (41 per cen* of the 
responses m the low-mcomc and 51 per cent m the high income 
provinces) The most particularistic response, the reliance on "pull/' 
acquaintance with people of influence and power, the patronage so 
important in traditional societies (and so often noted as an evil by 
critics of Spanish society), is important but not more so than qualities 
that involve an clement of ‘ oiher^onenicdocss ***“ 

The basic pattern of regional differences is reproduced with relatively 
few reversals in all occupational groups Luck, independently of 
economic development, is more important for farmers and farm 
laborers, but it is less important even for them in the more developed 
areas It is also more important for manual workers than for other urban 
occupational groups pariicolarly upper middle- and upper-class strata 
The relatively small differences among workers by economic develop- 
ment may be due to the large proportion of rural migrants among the 
workers m the most developed parts of thecountry Atiheoihercxircme 
we find that the importance of micHigcnce increases with economic 
development in almost all occupations (The only significant reversal is 
among small business people and independent workers, where luck and 
hard work seem to be more important in the most developed areas of 
Spam ) It js noteworthy that while the importance of work increases 
from the low-income to the medium-income provinces (among farmers, 
manual workers, free professionals) and among some groups continues 
to increase from medium to high income, (here is a significant number 
of cases in which it is emphasized less in (he developed regions, particu 
larly among the free professionals and the medium and larger business 
men It seems as if these groups would consider intelligence — and 
probably education — as more important with hard work considered 
to be a traditional petit bourgeois virtue We would not mention this 
finding ifa study of Spanish businessmen, employing over fifty workers 
had not shown (he same pattern by region and size of business ■* * 


Obviously, ihc response la the qucsiion i$ ooty lndicaiiv« of quitiiies ihai mithi 
be described by this Icrm as used by DawdR/eiman mcoHaboranon wilb R Denney 
and N Glazer The Lonely Crovd A Siudt of the Changing Amerlmn Character 
(t^cw Haven Yale University Pros 1950) 

' ' The emphasis on hard work is a lypical response or smalt businessmen (M per 
cent of those cmployms 50 lo 1 00 workers mention tt among fisequalii«« tor success) 
white among the heads of large enlerp«»« (* workers) only ^per ccni 

mention it. stressing more Mhinkfrig ahead and filing along 
Data are f^rem a fonheommg study by Juan J tarn and Amando dc Mrgucl on Ihe 
Spanish businessman 
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As m the case of a national sample, study of youth and «'ationship 
between development and particularism arc not as 
of Talcott Parsons as many students of social change 
expect The same response seems to have a dilTcrcnl meaning f 
occupational groups m similar contexts Among rural groups ^^nowmg a 
patron seems more important m the poorer parts of the countp^ A 
the other extreme of the stratification system - free professionals an 

busincssmcn-wcalsofindthcgfeatestcmphasison particularism in the 

underdeveloped areas (where they also more often expected favorable 
treatment by the police) It is among the urban lower and lower- 
middle classes that we find the unexpected reversal particularism is 
perceived as more important in the developed provinces We can only 
speculate about this response It might be a result of the recent migration 
to the cities of rural people who substitute the belief in luck inap- 
propriate to an urban context — for a more prcdictiv e trust in protection, 
a belief that also would protect thetregos from being hurt by having to 
attribute failure to lack of hard work or intelligence Or il might be 
that the response reflects the disappointments sulTcrcd in relation 
to their expectations of rewards for achievement in a society where 
particularism is still important the larger proportion mentioning it 
reflects, not a greater rate of particularism in the society, but a greater 
sensitivity to it The low emphasis on luck and pull among the most 
privileged in the developed areas may reflect, not reality, but iheir 
greater commitment to the values of achievement and univcrsahsm, as 
well as the psychological need under those circumstances, to attribute 
their own privileged social position (indirectly) to their own efforts As 
Max Weber and Robert Merton have stressed the successful also want 


to feel that they have a moral right to their privileged position 

Let us stress the methodological implications of the preceding 
analysis Without the contextual variable (in this case derived from 
ecological data) we would not have been able to go beyond the impact of 
occupation on the perceptions of the social system We would not have 
discovered that the value system that seems to dominate in a society 
changes with economic development, even for those who do not occupy 
directly the positions resulting from that change The change is one of 


The problem of particularism versus univcrsahsm in Spanish culture society, 
and alt^dcs has been studied using data from a naiionat sample study of Spanish 
youth by Amando de Miguel Estructura social y juventud cspafiola normas 
RevIsta del fnsniufodetaJurenrud / (Madrid 1965) pp 111-144 
^ also articles by the same author in numbers 0 (1965) 5 and 6 (1966) based on 
the same national survey 

** “Social Structure and Anotiue ” m Merton, op at pp 145-149 
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mfJjcu But such a contextual analysts alerts us to the fact that similar 
responses may, m difTerent contexts, result from different motivations 
or mean difTercnl things, and m ihts our data point beyond ourpreMOus 
example Some of the rcsersals in the high-income provintss and the 
difTcreni patterns m various occupational strata m those provinces 
point in that direction (\Vc would not be giving such emphasis to this 
finding if we had not encountered similar patterns in two other in- 
dependent studies ) Here we are dealing with what Peter Blau has called 
contingency cfTects,** a t>pc of structural effect that he describes as 
cases where the structural effects, * instead of having an effect on a third 
varnblc that is mdependent of the individual s value orientation, may 
determine whether the individual's value onentation and a third % anablc 
arc related or how they are related ’ In the case wc arc discussing the 
individual’s value orientation — for which we have no information — 
IS his own attitude toward the universahsm particularism dimension, 
his valuation of its legitimacy, that may be different among people in 
the same occupation in different regions 
If we wanted an example of how basic social positions have a different 
subjective meaning in different contexts, the data about subjective class 
identification (Table 3) of seven occupational groups in three types of 
provinces by economic development would be sufficient The most 
striking difference can be found in those from the lower-tncome and 
lower-status occupations who identify themselves as “poor” rather than 
‘*working”-cIass Certainly a farm laborer or manual worker wlio calls 
himself “poor" is likely to have a different view of society and politics 
than the one who calls himself working class Leaving aside those who 
arc nothing but poor m a strict soaological sense, that is, those without 
an occupation, the poorly paid have the choice of seeing themselves as 
poor or working class, as poor or as workers, their expectations and 
demands are likely to be quite different Undoubtedly, the long lime 
weakness ofthe Spanish Socialist Party m areas with a large proletariat 
and Its strength among workers in developed and scmidevcloped areas 
arc not unrelated to this subjective perception of class position The 
proportion of those m the more privileged strata who are willing to 
consider themselves “working class” ako increases with economic 
development These findings about the greater willingness to identify 
oneself as working-class with economic development is an mlranation 
replication of the same finding when we compare the answer to an 
identical or similar question m nine Western countnes — with 93 per 


Blau, <7^ cif.PP ISJ-IW 
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cent of Dutch workers so identifying themselves, 57 per cent of a 
Mexican sample, and 85 pet cent among Spaniards ** 

In the examples presented we used an ecological vinable as a variable 
m the analysis of survey responses, but there is also the possibiliiy of 
analyzing ecological data on the basis of information derived from sur- 
veys In a study of Spanish businessmen** we obtained the prestige 
ranking of five elite occupaiions and calculated the average rank of each 
among the businessmen in a number of provinces From various sources 
— biographical data lists of civil servants, census data — we were able to 
obtain the proportion of members oi the elites born in dilTerent prov- 
inces and calculated the extent to which those born in them were over- 
or underrepresented m relation to the population of those provinces at 
the time of their birth This allowed us to correlate the ranking of 
provinces in terms of prestige assigned to various occupations and ihcir 
ranking in terms of iheir recruitment This effort to relate ecological 
data to survey data served to test the hypothesis that prestige and 
recruitment into occupations are closely related Certainly data on the 
recruitment of elites cannot always be obtained by surveys, particularly 
national sample surveys, but they are available in many sources giving 
the place of birth On the other hand, the prestige of occupations in 
different milieus cannot easily be assessed except by survey data 
Bringing both together permits us to reach new conclusions Survey 
analysts should therefore be more concerned about the type of ecolog- 
ical data that would allow them to test additional hypotheses for which 
they might not have data m iheir surveys To give another example, it 
would be diflicult to obtain m surveys information on violations of 
norms, but it is not dilficuU to obtain opinions about the legitimacy of 
norms, about the degree to which norms hurt the interests of people, the 
extent to which people would condone violations or be puniiive toward 
violators, would fear punishment, or would favor change in the norms 
It seems obvious that we could more often design research about 
deviance from norms relating such attitudes to the actual rates of 
deviance recorded by the police the courts, or administrative agencies 
for the same groups in specific communities or administrative units 
What better test is there of the sigmficance of attitudes toward labor 
legislation than the number of violations in industrial communities 
where different altitudes predominate’ Such studies could belter 

‘'Data tjuoied m Fundacidn FOCSSA fnfimw jocfahgito la 
tocial de Espana op cn 

** Ltnr and Miguef ‘ D presugw de profcsxwies en el mundo cmpreMf's*. 
op fit pp 56-64 
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determme the importance of vanotis attitudes m encouraging or 
deterring from deviance 

PROBLEMS IN THE COMBINED USE OF ECOLOGICAL AND 
SURVEY DATA 

If the advantages just mentioned arc convincing, the usefulness of 

combining both types of data should not ,X'"uU s 

about the feasibility of doing so in many cases and about the dillicullies 
involved Certainly both the orthodox survey design and the normal 
construction of questionnaires will have to be modified, and the more 
refined ecological analysis may have to be modified to an extent tha 
many ecologists will find disappointing 

Generally, randomly selected sampling points proportional topopula- 
tion will not be adequate to represent ecological areas defined by socia , 
economic, and other characteristics, particularly if some of the most 
distinctive ones are sparsely populated On the other hand, some 
heavily populated ecological types would be more than adequately 
represented in a national sample An oversampling of the first and an 
undersampling of the second might be the answer, with a weighting of 
interviews to obtain, at the same time, a representative national sample 
The oversampling of certain areas and types of communities will 
always be necessary if we want to analyze the impact of ecologically 
defined contexts on the attitudes and behavior of individuals or organ- 
izations located m them (The only other alternative would be extremely 
large samples ) This need becomes even greater when we want to 
explore structural effects m a particular community with its global 
properties (for example, the long-time leadership of a particular 
politician) or properties derived from altitude distributions within the 
community (the “climate of opinion”) If that information is to be 
obtained m the same survey, it should also, in principle, be a sample 
representative of the community These difficulties will be compounded 
if through sociometnc techniques wc attempt to go beyond an imputed 
climate of opinion (derived from the altitude distributions within the 
community or organization) to the interaction system of the individual 
by interviewing also those persons he mentions as friends, opinion 
leaders, reference groups, etc , since in that case the original sample will 
be expanded m an initially undetermined way The study of attitudes 

Elihu Katz and Paul F Lazarsfeld, Personal Influence The Par! Played by 
iIq Communications (New York Free Press, 1964), see pp 

149-161 on designators, inriuentials, and mfiuenccs (Part II. Chap 1) and ihe 
Technical Appendixes 
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and behavior in ecologically defined contexts by survey methods, 
particularly in a specific set of individually defined communities rather 
than merely a type of community, is not easy within the framework ofa 
more comprehensive (national or regional) sur\ey without loss of 
statistical representativeness by either TTus dilemma cannot be dis- 
missed since without an adequate knowledge and understanding of the 
over-all basic relation between the variables we cannot be sure that the 
differences observed between coramnnitics arc due to compositional 
elTects In addition it would generally be illegitimate to derive state- 
ments about the larger urme/te if the selection of such sampling points 
has been made on the basis of ecological (often purposne) criteria 
rather than randomly or if for ihesakc of concentrating the interviews 
in those places where we expect interesting compositional or general 
structural effects certain types of areas are omitfed as sociologically 
or demographically less interesting The discussion of theoretical 
advantages of the combination of ecological and survey data can 
Ignore these practical problems But unless endowed with an unlimited 
budget, any attempt to make use of (heir combination has to lake such 
problems into consideration (os I quickly discovered when planning a 
research design for an OECD study of Andalusia) '** Several possibilities 
for circumventing this dilemma are open 

1 One could use in the ecologically based studies questions from 
nationally representative samples that could serve as a “base line ’ 

2 One could execute simultaneously wuh many identical questions a 
representative sample survey of the larger unit and the contcxi-onented 
sample surveys 

3 In the case of cumulative research by the same organieation or 
researcher, the information about the climate of opinion in particular 
units derived from intensive research could be treated as a global 
property in the analysis of a smaller number of respondents in the same 
community found in a national representative sample 

4 The partial duplication of previous surveys in ecologically defined 
contexts would allow improvement of the representativeness of (he 
national survey for them, with the addition of new problems particular 
to a new study of those units It would be best to place the findings in a 
national or larger context without the costs of a new national study 

All these approaches involve uneasy compromises The choice to be 

'•Smdy of Institutional and Social Otsl«^o Economn: Development of 
Andalusia (Spam) by a isam of experts of the OECT> 
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made be more or less sat.sfoctory for either the '’/“j. 

contextual analjsis or the more descnptnc (and/or analytical) stu > 
r TargeTunit'particularly of the «hole society For -n ^e 

Andalusia study the sursejs in a smalt number of communities 
uilhin socioeconomically typical areas will be 
community pouer and desclopment aspects and for the rural 
but consideLly less so for the member of the research group 
in descnbing, interpreting, and predicting migration and force 

trends for the region I believe that most compromise solutions betvvccn 
the traditional representative sample survey and the ecologica y 
context oriented surveys will have to use in the analysis rypcJ o 
ecologically defined areas (grouping communities and sampling points) 
rather than focus on indntduaHy identifiable contexts in whic t c 
individuals interviewed would be actually interacting socially or be 
exposed to identical global effects This means that in most cases we 
shall be dealing with some hind of abstract structural elTects rather than 
V. ith actual compositional effects 

The problems of combining survey research techniques ^vlth ecological 
anal>sis become compounded when we consider the secondary analysis 
of already collected data or the use of the national sample surveys of 
public opinion research organizations, which cannot devote so much 
effort to planning ecologically based studies or designing such com- 
plicated samples as suggested earlier In this case we have to limit 
ourselves to asking for much more adequate reporting on the IBM cards 
for secondary analysis of the sampling points so that we shall be able to 
make our own combinations in view of the ecological analysis 

An effort to improsc the representativeness of the samples for regions, 
size of community, types of soaoeconomic areas, etc , by stratifying 
the sample by such criteria would be useful In this case an attempt to 
code m the original study basic ecological information would help 
secondary analysis The availability for each country of a data bank for 

minor administrali\euniis‘’(municipa1iiics, cantons, even departments) 

whose information would automatically be added to every interview 
would be ideal (With the use of electronic tapes this should be neither 
difficult nor costly ) Even so, an ecologically oriented secondary analysis 

ForFinUnd see Allardl, in Memtt 
tod Rokkan (cds ) op clt^ pp 33S-339. for the «nables included m a fUc for the 
550Rnnishcor^^ ForNoroay «e Stan Rokkan and Henry Valen. “Archises 
for Stat«tic:^d«s of W.thin Nation Differences ,n /W, pp 412-418. and 
o Finnish Communism Traditional and Emerging 

Radicalism, t'^ermilonalJoumal of Comparal, re Sociology. S pp 49-72. 
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rvfrfdjfuo ‘wruj cJjfa atttau tncounfcr Jhe (l*frculi> of Jack of 
/rprfxfnuu»cnft» and ifralj ate of a«j pari of a raiioMl samite for 
iintjrooc aieay ot conmunttK^ Only the actuniulanon of ic%cral 
naljoraJ van-p’o *.oH!d thr* d^fnculfj. ^h(^. m turn, uoufd 

tcquifc iha! \u;Jt naiional vamplct be drawn the unie wj), u<jnp the 
^aroe bacVpTi'onJ charaettr jvi»ci fwiib ibf ume c»Hbnp) and ai Ic.tit a 
trummum of unetb idcntKral que^tmot o»cr a reJativeh ihort pertod of 
time. Such an apprt»3ch. inmlttitalJ). would aJwi facjliisJe trend jjudiet 
(Marled ihaeye^ mtr im'c naturally would male llieareumubtmnof 
tntcrvicwx for the ume eodopicAt atCA3 ofduhiout \ 3 l 1 d 1 t).) 

M) fceliRjt i» that t^vul frvanh could male prcaitr ad^anccj if all 
Cirnctf nod would more often cicrifs^e the tle«ire to be onpmal. always to 
ail rew qucttmnt and jo obtain a» mart) attitude data as po«ib!e. but 
rather would repeat more often those queiirom already u«d in the past 
or In anothercoupjr) and oHatn more defadetl “bjclf round urtaWei.’* 
Ibc flccumulaiion ol inier\»tws for ecolopicaJ analysts would require 
ihit, fuftrculaily since for the esploraiion of “composiiioRaJ cITects** 

It n essential to he able to coidtol for a number of baclpround satiabfcs 
ttmuflaneousl) (oceupatmo, education, income, Icnpih of residence, 
lociai intepration. organiraiional irwmberships. etc ) Otherwise it 
would be jnipoiiiblf to know whether the diwreparicres found between 
ihtrse Ititnir tn different social eonteats are due to Iheir differences 
as ptj’ple (either by hackground or by selft'cleclion) or fundamentally 
to the fact of lising m a djirertni conieai or milieu. If m Spam, for 
esampic. we want to compare the attitudes of naior Catalan and 
nasque workers in those lepons with immigrant workers from other 
parts of Spam and with each other, we would, ideally, male the com- 
parium between workers with similar occupational skills, incomes, 
worlmp fit similar plants, etc., to make sure that we do not altnbutc 
tn the •’chm.sttf of laKv relations*' in each region differences due to the 
poiiiion of the workers studied in the social and economic structure of 
each region Tliosc dilTcrencei in position exist and naturally con' 
tnbule to the “climate of labor relations.” But the componential effects 
cart be isofaied only by analytic study of those groups not coninbuiing 
to them, that is, by compartng metalworkers rn both areas, esen when 
the milieu of Barcelona is shaped by the dominant workers employed 
in familv textile enierpriscs.^’ It should be noted that studies focused 


'• for f\amrl« kc l^c hawd on the of Ihc Spanuh businessman. 

wrtwuUrty: Unr and Mifuet. "ttcmiwttfw awe 1« probtema* laboralcs. 
cr r/r.rp-»'2^. 
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on structural efTects may be h.ghly nttslearlmg as 

since groups central to the analysis may well be numerically, Y. 

rrhnirr" TuU and ecological analysis combined wd. 
often advance social science knowledge, the formulation 
and the testing of hypotheses But it also may lead to “ 
of reality An occasional glance at the absolute figures for P°P“'" 
tions represented by those sampled in the survey would easily make us 
aware of the actual significance of groups whose analytical 
IS unquestionable For example, the few peasant owners of olive grove 
in small West Andalusian communities are important for soctoiog 
analysis But this should not make us forget that olive cultivation 
predominates on large nonpeasant farms, in larger communiti«, c c , 
and that it is this dominant social structure that characterizes Western 
Andalusia This difference in perspective is not unimportant in accoun • 
mg for a certain dissatisfaction of policy makers or even of those of us 
attempting to describe a total national society with much of the know- 
ledge accumulated by sophisticated studies 
The combination of ecological and survey research, particularly il 
data from government records (censuses, labor force surveys, social 
security data, immigration statistics, special surveys, etc ) become avau* 
able for secondary analysts, will require the survey researcher to make 
his questions more comparable with those used in collecting these kinds 
of information The census-type questions will have to be asked and 
coded in identical ways, something not always satisfactory since the 
manner of asking them in official forms is often unsatisfactory m the 
interview situation (for example, it may require the respondent to fill 
in tables or give data with a precision obtainable only if the trust in or 
the power of a government is absolutely assured) The experience of 
survey researchers could certainly be used to make official agencies more 
conscious of the shortcomings of their questionnaires from the point 
of view of the interview situation (that is, their inclusion of tables to be 
filled by the census taker without any actual wording of the questions 
asked by him) 


In the preceding pages we have emphasized changes, and often sacri- 
fices, that the traditional survey research approach will have to make 
in Its marriage with ecological research But as in all happy marriages, 
both sides will have to make some sacrifices The ecologist is tempted 
to stress the uniqueness of certain regions as geographicosocial units 
or, if he has a historical bent, to stress the influence of history on socio- 
structurally identical areas, or, if he attempts to develop typologies of 
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socioeconomic regions of a certain homogeneity (particutarly agri- 
cultural regions combining crop data with land tenure systems and 
demographic change data), to arrive at a rchtively large number of 
such units If we consider that those ecological units have to servem 
designing samples for survey analysis and that samples, to be representa- 
tive, cannot be below a certain size irrespective of the size of ihc popula- 
tion sampled, it becomes apparent that the natural (and basically 
justified) tendency to multiply the units of ecological analysis becomes 
a source of strain betw een the research and the colleague who is planning 
a survey on the basis ofhis efforts 

To avoid the divorce of ihe two methods a compromise is needed 
One possible solution is to recombine the more adequate and refined 
ecological area types into broader ones defined only by the mam 
variables Another is that the survey analyst might convince the ecologist 
that, withm any given budget, he can represent adequately only some 
types of areas, ignoring others either m terms of their demographic and 
thereby electoral, political, social importance or in terms of their 
strategic research interest irrespective of demography The methods of 
areal representation used by ecologists coming from human geography 
often make the first approach difficult since only cumbersome calcula- 
tions can transform areas that appear so distinctive on the maps into 
data on population We have the impression that ecological data can 
serve as a basis for designing a survey only after their transfer from maps 
to IBM cards so that types can be selected at mil by using different 
dimensions simultaneously and so that rapid estimates of their demo- 
graphic significance can be obtained Praciically, this means that the 
data from maps or census type statistics have to be coded or classified in 
intervals and that the cards have to include the absolute population 
figures ” Obviously, the inclusion of other absolute figures would allow 
us to obtain an important “by-product” the averages on other variables 
for the types selected (which sometimes could be compared with 
averages obtained by survey mclbods from respondents living in those 
areas) A fruitful by-product of this marriage of the two methods would 
be to make the ecologist more aware of I'ypvJ'Vsgwif a? 
central to modern survey analysis and to make the survey rescarehcr 
more conscious of the complex combinations in which the variables he 
uses analytically are found m social reality It would also make him 


»' In comparisons over lime ihc felalive figures can be misleading for essirptc 
ihe percent of men amve in agncolfure hi a dwinci may decrease «mos sfler emsat 
while Ihc absolute number engaged in farming may be mwasmg with iheconiequeni 
pressures on Ihe land resourm and jwulimg soda! conflicu 
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more sens.t.ve to the demographtc s>En.f.cance of the groups he is used 
to seeing only analytically rather than as numbers of actual people 
,t IS probibte that the use of the ecological data now ." Statistical 
yearboohs or on maps as the basis for the design of survey studies would 
greatly accelerate their transfer to IBM cards and apes 
initiate data banks of a type that was not necessary as long as K" 
analysis relied mainly on correlations or averages for a limited numM 
of types of areas Experimenting with different typologies of socia 
structure in the process of defining ecological areas for sampling will 
require such data banks and will give the ecologist a new perspective 
In some areas like political sociology, survey research in the ast 
decades has become increasingly cross-national (producing studies like 
those ofLipset, Alford, Almond, and Verba, and others) “The question 
may be raised as to whether the more contextual orientation derived 
from a combination with ecology and the various methodologica 
adaptations (many of a practical nature) previously discussed will not 
limit cross-national comparability My feeling is that this may happen, 
but It does not have to happen In fact, an intelligent use of an ccologica 
approach could improve much cross-national research The logic of 
tnternanon comparisons of tntranation comparisons*’ relies on the 
collection of data on a cross-national basis with an ecological perspective 
m mind This requires the selection of samples within countries that 
are not fully comparable but relatively similar, in geographical or 
institutional settings that are highly comparable, on the basis of informa- 
tion obtained a prion on an aggregate basis The impact of the different 
institutional (democracy versus dictatorship, industrial versus semi- 
mdustnal, socialist versus capitalist) and historical factors characterizing 
total social systems on the attitudes and behavior of persons placed 
similarly in the social structure (as individuals living in similar social 
contexts, communities, regions of similar economic development, work- 
ing in similar organizations, etc) can then be studied much better 
Cross national studies of similar social strata or elites in comparable 
ecological settings could certainly contribute to comparative sociology 
as much as do present research designs For example, the comparative 
study of elites in Rome, Madrid, and Lisbon, on the one hand, and m 
Milano-Tonno-Genoa, Barcelona Bilbao-Oporto, on the other (with 


M Lipset, op cll , Almond and Verba, op at Robert Alford. Party and 

Society The Anglo-American Democraaa (Chicago Rand McNally, 1963) 

UM and Miguel, “Wiihm Nation Differences and Comparisons The Eight 
Spains, in Merrill and Rokkan (eds ), op dt , pp 267-272. 
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the consequent oversamplmg m some of those cities)/^ would certamly 
provide better insights mto the elites and the social structure of these 
three areas than one designed to obtain just a representative sample of 
the elites, ignoring the dislmctive role of those types of cities 
In general, an approach oriented toward developing rjper of eco- 
logical areas and aiming at representative samples of such areas rather 
than toward representing dtstmcine comnumiues as such would be 
particularly fruitful for cross national comparisons The focus on actual 
compasitioml effects m this sense could be slightly detrimental to 
over-all international comparative research if no attempt is made to 
link such surveys with national survey data However, as J noted before, 
any tendency to study particular ecological contexts without reference 
to the total national context is undesirable, on strictly theoretical 
grounds, m the study of community climates of opinion and other 
structural effects Ecologically oriented survey research can be an 
advantage, not a detriment, to comparative research 


SUMMARY 

While theoretical considerations— the awareness of context and 
structural effects — and the immense advantage of using the store of 
social and economic data accumulated by most societies strongly urge 
us 10 combine ecological and survey research methods our discussion 
has brought out some of the many practical difliculties in doing so 
The solutions required (unless the research should have at its disposal 
budgets far beyond what is usual m social research) will I fear, not be 
fully satisfactory to the survey analyst and even less so to his statistical 
adviser, and it will distress the ecologist intimate!) familiar with the 
spatial distribution of social phenomena Tlic need to obtain representa- 
tive samples of reasonable size, to accumulate data over time to com- 
bine national with regional or local studies to facilitate a certain degree 
of cross national comparability will oblige os to pay much more atten- 
tion to theoretically sound and empirically usable typologies of social 
siruclurc,** using the most sahent dimensions of social systems and 


Tor an example of a paper bnngmg together diti chanctenzine ihc soeul 
structure of a aiy and proi Mice jn a mlioml ccmiexl. see Amando dc Miguel and 
Juan J Linz ‘ El papel dc Rarcctona ei» la estnrctora socwl expa/loU “ in La 
Froifncia JJimerrsiM^tvrtfnlm/ev (B3rcet(3na Imiilutodc 

Ciencias Socj 3)« Diputaci^n Provincial dc lliTCctcma 19661 PP 2a) 234 

** For an ininat tivt of dimcniion* lo he m developing such a l>7’ology %ce 
LIniandM’Surl Withm NiiionDilTercncwandCpmpativoM The rigMSsumt," 
in Memit and Rollsn (cds ) o/r e*. pp 279 2W. 
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disregarding some of the uniqueness that the ecological approach 
would force upon us In contrast to much of the traditional su y 
research that grew out of a socopsychological 
individualistic and rationalistic bent, the approach 
would have to turn more to macrosociological perspecti e 
such simple conlinua, like Gememscimfl versus Geselhchafl, foltr 
urban, rural versus urban, peasant versus feudal or neofeudal. clas 
versus status oriented, tradition versus modern, industrial versus 
industrial, developed versus underdeveloped, would quickly prov 
insufficient if not misleading To consider them as continua rather ttian 
as dichotomies would be helpful But we suspect that more retineo 
theoretically and empirically meaningful types of social structure are 


needed and can be de\ eloped 

We are all aware of a number of djmensions, neither small nor 
unlimited, that v.e can use to characterize a society or unit within a 
society We should make an effort to agree on a limited number of 
such dimensions and then decide on certain cuttmg-ofT points to pbcc 
societies or units within the resulting typology (Certainly the classinca 
tions in each specific case could be quite refined, but there should be a 
possibility of combining them into a more limited number of categories 
for comparability ) My feeling is that it would be easiest to progress in 
this direction by limiting our attention to countries within the same 
cultural area, with relatively similar degrees of economic, social, and/or 
political development For cross national research it would be ideal to 
be able to sample representatively regions, provinces, or communities 
falling into each of the “boxes’* of such a typology, as long as the 
population falling into each was of a certain minimal numerical im- 
portance By such a combination of ecologically defined units with 
sample surveys we could make more cross national comparisons holding 
constant a relatively large number of variables and more comparisons 
of intranalion dificrcnccs — two types of companson that are difficult 
to make today by using national samples Another gam would be to 
discover the absence of certain types of social structure m some soaeties 
We could start doing this on a limited number of dimensions (between 
five and ten) for half a dozen countries, using the data for provinces and 
major cities and taking the information from published data On each 
of those dimensions we could use more or less complex classifications 
or one based on their positions with respect to the national average or to 
that of the countries being studied By classifying each administrative 
unit and adding up their populations, we could estimate what propor- 
tion of the population lives in each “structural context ” By doing this 
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for a number of census >ears, VkC uould be able to describe more 
accurately the social changes (hat ha\e tak«t place and their direction 
In summary, we believe that an eflbrl to combine ecological data and 
methods with survey research will contribute, indirectly but con- 
siderably, to macrosociolog) 
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Social Context and 
Individual Behavior 


BACKGROUND OF THE PROBLEM 

Theories and aJ hoc explanations m the social sciences tend (o be of 
greater mterest when they link phenomena at diflerent levels However, 
m sociological and social psychological publications that are based on 
empirical research a different kind of ihconamg has been common 
relating two or more variables at the same level of reality Of course, 
there has also been a tradition of cross level theorizing m (he social 
sciences, especially for topics with political implications Although 
these types of statements have a greater inherent relevance, they have 
usually been based on inadccjuate empirical measures This paper is a 
plea for combining the greater relevance of cross level theorizing with 
the greater rigor of so-called direct measurement The mam obstacles 
to such a combination (which has become technically feasible) arc (I) a 
lack of theoretical formulations (hat relate both lescis as interacting 
realities and (2) certain intellectual traditions 
Whenever social scientists have been their own recorders of sense 
observations (for example, via controlled observation or interviews), 
they have preferred to use data on individual behavior (such as mobjJil> 
aspirations) to account for highcr-wicr phenomena (such as rigidity of a 
status system) For soaal psychologists, and also for many sociologists. 

It has accordingly been usual to "cxplam*' higher-order phenomena by 
means of relative frequencies observed at a lower level of a social 
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system,' ether in terms of a partial explanation or with a reductionis 
intent ' In European political science more often highcr-order proper les 
(such as stability of political preferences ‘f 
preled as if they were expressions of individual behavior (such 
stability of an individual’s political preference) 

Implicit in such substitutions for direct measurement is, in many 
cases, the assumption that occurrences at one particular level have a 
“higher” degree of reality. Thus individual behavior may be treated as 
being the only real phenomenon, while system properties arc abstra^ 
tions, or individual behavior may be viewed as a mere reflection ® ^ ® 
only reality, namely structural properties — as some Marxisl-inspire 
writings assert Naturally, such orientations arc rarely defended as 
explicit theoretical positions And often the substitution for direc 
measurement is an accommodation to past necessities that turned into a 
tradition'* 

In cross'level explanations our chains of inference are often fright- 
fully long, m spite of the verbal respect that in empirical social science is 
paid to Robert Merton’s notion of “theories of the middle range.” We 
are so conditioned to this practice that we do not consider it outra- 
geously bold when direct equivalences are posited between presumed 
personality properties in a population (such as authoritarianism) and 


* A major example of the ‘ individualistic fallacy” is Gabriel Almond and Sidney 
Verba, The Civic Cuiiure (Princcion Princeton University Press, 1963) 

’ Most versions of a rcductionislic orientation arc related to the ontological con- 
ceptions of Auguste Comte, namely that highcr-ordcr phenomena arc just com- 
binations of lower order phenomena, arxordmgly. there should be a hierarchy of 
science where each preceding science in this hierarchy “explains” the phenomena of 
the subsequent via a reduction to Us own level of observation Thus, chemistry could 
be “reduced” to nuclear physics, psychology to neurophysiology Rcductionism m 
the social sciences usually lakes the form of “explaining’ sociological statements 
with reference to psychological variables and laws This is the explicit program of 
George C Homans, c g , m his Soaal Behavior in lis Clementary Forms (New York 
Harcourt, Brace & World 1961) In the form of a reduction of sociology to psychol- 
ogy, rcductionism seems mluitively plausible Nevertheless, reductionism as a 
general stance is logically untenable, cf Cmcst Nagel, The Structure of Science (New 
York Harcourt, Brace & World, 1961), especially chapters 1 1 and 14 

’ Cf the conception of "climate of opinion" in B Berclson, Paul F Lazarsfeld, 
anUW MaePhee, Ptu/n;? (Chicago University of Chicago Press, 1954) 

* The dearest example is probably the school of g^ographie electorate in France 
The logic of procedure is basically still the same as in Andre Siegfrieds Tableau 
politique <le la France de I Quest sous la Troijieme Repubhque (Pans Cohn, 1913) 
For a cniical evaluation see Austin Ranncy (ed ) Essays in the Behavioral Study of 
Fo/J/icj (Urbana University of Illinois Press. 1962), pp 91-102, sec also Erwin K 
&hcuch. “Cross-National Compansons Using Aggregate Data ' in Richard L 
Merritt and Stem Rokkan (eds ), Comparing Nations (New Haven Yale University 
Press, 1966), pp 131-168 
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the character of a polity = Perhaps the chains ofinrercTOs are so Ions 
and so tenuous because the data we work, wjth leave us so much freedom 
to organize findings in a variety of ivajs Especially when we work with 
data from only one level of observation, the dtta put few restrictions 
on the fertile mind of a social saentist—and this is naturally an 
appealing condition 

Today most of our empirical knowledge m mdustnahred countries 
IS based on surveys or other recordings of the verbal behavior of in- 
dividuals A check of a recent attempt to inventorize our social science 
knowledge confirmed this impression ‘ This check also showed how 
extensively we have neglected what Stein Rokkan termed “process- 
produced" data and have faded to take advantage of the fact that few 
things occur in developed societies that do not leave a trad of paper and 
numbers ’ Not only do social researchers prefer to collect their own 
data at the expense of other available information, but they are also 
inclined to be monomaniacal with respect to techniques of data 
collection 

This description characterizes the situation m Anglo Saxon countries. 
Scandinavian nations, and Central Europe In some other countries, 
notably the Latin societies and Eastern Europe, social scientists more 
often prefer to work with one type of process produced data ofRcial 
statistics This is especially true for political behavior In addition, m 
most of Europe election returns and census data arc the kind of data 
that the mass media and (he politicians accept most readily as “hard 
evidence " Since in all Continental countries the fate of social science 
as a branch of learning as well as (he fate of individual social setendsts 
is more influenced by the reaction of mass media and the intelligentsia 
m general than fay fellow social scientists, this veneration of ‘ official 


* Ihis type of reasoning «3s very popular JnMncdaicly foJJowmg World War JJ 
in attempts lo account for the development of totalitarian regimes in Germany and 
Japan Theodor Adomo, Bertram Schaffner and Dawd McClelland (for Gcnnany) 
and Ruth Benedict (for Japan) are authors whose eapfanations gamed a good deal of 
credence and even fame A popularized version of ihc reasoning is found in the 
wrilingt of Ctfo/frj'Ccvw- If the ohjtfrt of r«pIanation were not a nation but rather a 
factory, it would be much less plausible jf an author were to argue that the specific 
character of the Renault versus the Peugeot factorres and Iherr products was doc to a 
personality trait common lo ihe workm m one but not in the oitwr factory Di 
planations of political systems wiih reference to personality Irani of Iheir memben 
are plausible only because they lend lo rej^iase m an acceptable lernimofogy what is 

alrcadyknownby'''ayofnaiion3lstereotypcs f 

»Cr Bernard Dercison and Gary A Sterner H^n^tMaUcr- An ln^ef,rory of 
Scientific Findings (Hev/ York Harcourt Brace A. World, 1 9W) 

’ Compare Stem Rokkan fed ) Data Arehees far the Social Setenees (Tans 
Mouton, 1966) 
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data’ ts one important stimulus for basing cross-kscl analjscs on one 
tel (rggregrerdala, and this kind ofaudtence for the social sc, ent.st 
also prOTtdes a largely uncritical market for one level J „ 

The audience for social science research, and specifically 
relevant writings, accountsjusl as much for national 
use and m theorizing as any inherent need or any 
could possibly do Traditions arc undoubtedly important *0°* ^ 
present interest of Continental sociologists and 
aggregate data is still a consequence of the fact that m the nine 
century data from the official bookkeeping of bureaucraciw 
available before social scientists became their own data collectors. 


the United States the reverse tended to be true 
The most important type of data from ofTicial * social bookkeeping 
are so called ecological data, that is, data aggregated for territoria 
units In the United Stales the use of such data has become an object oi 
a much justified criticism ^ Insofar as these data arc employed as m 
dicators for individual behavior, they give rise to the * ecologica 
fallacy”*® If a direct measurement for lovscrdcvel units (such as in- 
dividual voters) IS available, the use of ecological data for statements 
about individual behavior is obviously cither poor research strategy 
or a regrettable adjustment to one s limited resources The same is m 
principle true for other kinds of grouped data such as information 
aggregated according to school classes army units, or hospitals 
In all these instances the chances of becoming the victim of an 
ecological fallacy, or of other versions of the grouped data fallacy. 


•The consequences of the societal and instiiulional settings in which soaa 
research is carried out and specifically the impact of diffcrenl kinds of audiences on 
the style of social science are analyz^ m Erwin K Scheuch Sozialforschung und 
sozialer Wandel ” Kolner Zeuschnft fur Sozlologtf und So,.ialpj}chohgie 17 (196*) 
pp 1-48 

’ For example sec Austin Ranney The Utility and Limitations of AggrcS^ic 
Data m the Study of Electoral Behavior in Ranney {ed ) op at , Roy G Francis 
OnlhcRelationsofDatatoThcory Rural Soaohg) 22(1957) pp 258-266 
'•See William S Robinson "Ecologcal Correlations and the Behavior of In 
dividuals American Sociologtea! Reuew /5(1950) pp 351-357, Herbert Menzei 
Comment American Sociological Rewiew 15 (1950) p 674 Erwin K Scheuch 
Cross-National Comparisons Using Aggregate Data m Merritt and Rokkan 
(eds) op ell pp 131-167 

“ Another example is the erroneous use of statistics m the controversy about 
whether Thalidomde caused malformations in newborns Intake of medicaments 
conuining Thalidomide occurred in terms of millions of units malformations m 

thousand of cases — a classical situation for committing ecological fallacies A 

fuller discussion will be found in Erwin K Scheuch Zur Methodik und Statistik 
^ Dysmehe-Forschung. IK»«wr Medizimsehe Woehemchrifi 117 (1967) PP 
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IS a runclion of the dcsree to which the marjinals of a contingency table 
also control the %vilbin-ccll variation “The dangcis are, Mem paribus 
especially great when one tries to explain from variations obser\ed on 
mgher-order units (like counties) the changes in the behavior of rare 
Jower-leve! units (like individual voters for small parties) 

Of course, Herbert Menzcl is right in emphasizing that ecological data 
(or other grouped data) are not always emplo>ed as a subsiiiulc for a 
more direct measurement on a lower level but are used as descriptors for 
groups or areas ’ ^ Many of the practical interests that applied research 
serves are of this type, as when politicians are interested primarily m 
numbers of votes regardless of who may have cast a particular ballot 
Also, a direct measurement may not in all cases be the most reliable or 
even the most valid indicator of behavior, as in some areas of consumer 
behavior There are, in addition, \arious techniques to obtain under 
some conditions an approximate expression of the size of the ecological 
fallacy Essentially, these techniques consist of cross*cutting sub- 
divisions of grouped material while observing the stability of the 
regressions across the different partitions of the material '* 

Thus there are instances where ecological or group indicators are 
cither the most direct or even the relatively most precise measure In 
general, however, we recommend as a rule m choosing indicators what 
covild be termed ‘ the principle of direct measurement” the measure- 
ment that requires (he shortest chain of inference is generally to be 
preferred This principle of direct measurement does not, however, 
imply a recommendation to avoid using ecological (or grouped) data, 
rather it points to specific usages of grouped data (hat have so far been 
neglected 


‘ * Cr Oils D Duncan and Beverly Davis An Alicmausc to Ecotogical Corrtla- 
tions.' Anttncan Sociological 18 (1953) pp 665-666 Leo A Goodman, 

'•Ecological Regressions and the Behavior of Irwlividuals. AmnUan Sachlogleal 
RevicM /d(t953), pp 663-664 Since these earlier puWicaiMrts Ihcre haa been JiliJe 
progress in discussing ihc deierminanis of ihc ecological fallacy, until ihc very 
imporiantccminbuiionbyHiy'AafdR Alfccr AT>pof<tgy<>rEcoIoeicatraJiacifs,* 
Ch3prcr4ofth»svo!umc SecalsoPaolF LaiarsfctdandN Henry The Application 
of Latent Siruciure Analysis lo QuaniilaKvc eoolos»ca) Data in Trcd MasMok 
and Plwlbum Raioosh teds A Moihemaiical Cxpl^ttons w B<fia\ioral Science 
(Homcwood.IH Irwin Press, 1965) 

'*Cf Mcnre) op cil p 674 Therers ofeourse nothing inferior about ecological 
or other grouped data w hat is at issue when working with ecological ^la is In each 
case Ihe relation of the crnerion according to which units are grouped, to the types 
of mfcrcnccs intended when using l)w resutlsofaegr^jaied uniti. 

'* Discussions of techniques lo estimate and contro! ihc ecological fallacy »«! ck 
found in Duncan and Davis, op «/ .LazarsfeWandHenry.inMassarikanO Raioosh 
(eds).o/» ei/ . and A)kcr, op cit 
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DANCERS OF THE INDIVWVALISTIC FALLACY 
The ecological fallacy (or group fallacy) has come under suir.aent 
scrutiny that we can now consider most of the problems rat 
tellectually settled, even though in practical research these mor 
tinue neatly unabated The reverse fallacy, to infer incorrect y 
observations on lower-level units the condition of higher-order sy . 
IS rarely treated in methodological literature This type of fallacy na 
been termed the “individualistic fallacy - Mn many cases there will not 
even be a suspicion of a fallacious reasoning, probably because o 
greater control over the individual case and the greater sense o rca i 


when dealing with individual units . 

The criticism that Hegelians of all known varieties, and m genera 

European culture critics, level against survey research expresses a ec mg 

of uneasiness, couched in philosophical terms, about an often rca 

misuse of empirical evidence Itisarguedandwjdclybelicvedamongt c 

European elite that soaal research (which incorrectly is equated wit 
survey research) just registers man's subjective rellcction of or unwittmg 
adjustment to reality instead of catching the real determinants of 
havior or the character of social systems Social research is accused o 
reinforcing an already false consciousness by publishing facts about 
mere epiphcnomcna In itself this argument is nothing but a con- 
temporary version of Feuerbach’s notion of Vercloppelung des Scheins 
As an objection m principle against social research, this reduces ilseli 
either to some sort of theology inspired by a revelation about the true 
nature of things, or to solipsism, as a corrective for a prevailing hack 
use of survey research and other kinds of individual measurements this 
IS a very useful critique 

In the course of another project we inspected a large number of 
publications based on survey research We found that m terms of the 
uses made of individual questions — that is, in their function as in- 
dicators— they could be classified as follows 


a Ego s responses are the observable phenomenon itself This is true 
for the usual opinion questions in surveys, for many psychological tests, 
or for scales that use statements Provided we abstract from such 
disturbing factors as response scis.*'^ the problem of the permissible 
inferences is shifted largely from the phase of using data to the act of 


||Cf Scheuch inNlcmil and Rokkan (eds ) op at pp 158-164 , 

^ GcselUchaff’m^KWjrf/ 


.iiiv.cumanQRoititantecls) op at p 

Adorno “Dct BegrifTder GcselUchaft ^ m 
uchfsSiaatsUxikon index Mord C«WZjcAo/r (StuUgart Kreuz Verlag. 1966) 
There exim a vast incraiure nn , .< .< Lee 


^ vast liicraiurc on response scU The locus ctassicus is Lee J 
t-ronbach. Response Sets and Test Validity" Educational and Psychohgfcal 
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construclmg the instrurarnts Typreally ihj uml of mcasiiremcnl and 
the unit of anal^'iis are lodentical 
b Ego's responses are ireateU as selMbservations This tends to be a 
characteristic of what are called behavior questions in surveys What 
kinds of inferences are legitimate is determined mitnly by the degree of 
congruence between responses and actual behavior Again, the unit of 
measurement and the unit of analysis are usually identical 
c Ego's responses are treated as an observation of one's environ- 
ment Examples are such frequently used questions as “What does your 
family ’ or “How often do people here ” In these cases the unit 
of analysis is often the behavior of persons or group outside the dyadic 
relationship between researcher and research subject 

tl Ego’s responses are treated — albeit very often unwittingly — as 
expert judgments Examples are “What do you think is most important 
to get ahead in your company*^ or Do teen jgers in your neighbor- 
hood cause more problems than they did five years ago'’ ’ — provided 
these questions are used as indicators for determinants m upward 
mobilityorasevidenceforchanges m the behavior of teen agers 

Like all classification systems that the human mmd imposes upon 
reality, this one too simplifies the actual complexities m using verba! 
behavior in social research Behavior questions such as ‘ How often do 
you listen to the radio'’ ’ may be used as indicators for attitudes toward 
the mass media, and opinion questions such as ‘ Do you believe most 
people here work as hard as they can'’ ’ may be employed as indexes 
of Iwhavior Secondary analysis has caused us to become more con- 
scious of the autonomy of uses that we have already practiced if we 
compare the actual use of indicators to ihcir manifest meaning Im- 
portant in this context, however, is only the relation that the type c and d 
uses of questions have to the manifest meaning of indicators 
It IS the type c and d of indicators that comes in for the kind of 
criticism mentioned earlier We shall mention but two examples <l) 
speaking with a!) members of a family would give us better data for 
processes that concern the lamily as a univ tdan rdf usexi’ s'odMWMvVc 


Afeasurfmettl 6(1946) rP 475-JM A ^ 

found in Lee J Cronbach, Esimhls of rsvthohsfc^ Tnung (New t|^0 
1962) A taxonomy of indicator uses in tbeir relaiHjn to l^c J' 
given .n Crw.n K Schcuch Oat rnien«w ,0 
ewp>mchei> So'mlfonrhan^ / iSimtgart Cnle Wbg inded 
'• Daniel Lemer has made extensive use of 
aiuiudcs Sec especially his book TU ftmine of TfodttuxiHtl (Ckncoe 

Free Press. 1958) Chapier 3 
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measure, namely the report of just ottc family member,' and (2) com- 
piling data from groups of workers in the same shop about tb="- 
ployment history would be a more reliable basis for analyzing factors 
in promotion than the opinions of isolated respondents Yet in o 
instances interviews with individual respondents arc vast y 
economical than the “best measure,” provided this is technica y 
feasible 

We have now become alerted to the problems accompanying the u 
of ecological data, but we still need to realize that the social scientist 
using survey research or other techniques of individual measurement is 
often dealing with indirect measures, too Calling attention to the prin 
ciple of direct measurement works both ways — not merely as a caveat 
against an uncritical use of grouped data but equally as a plea to realize 
that individual data can be indirect measures as well 
The problems associated with type d indicators arc compounded if 
individual responses are combined to represent measures of higher-level 
properties “Do most people who work hard and have a bit of luck get 
ahead?” is already a difficuU enough indicator to interpret for one 
individual, the national averages of those replying in the alTirmative is 
then certainly a dangerous measure to express differences in chances for 
upward mobility in various countries Incidentally, using the average 
responses to such a question as indicating the differences in mobility 
between the United Stales and France happens not only to be dangerous 
m principle but to be empirically wrong as well 

Dan Cahalan and hts Audience Research Bureau have shown to what extent 
husbands and wives disagree in describing iheir consumer behavior By and large, 
reports dilTered most when they required from ihe respondent that he should 
manre a large number of behavioral acts, as in a question like ‘ Who has changed 
most m ideas and altitudes since you were married’’ ’ By implication a question like 
this really asks the respondent to perform as an expert witness While Cahalan's 
remained largely unpublished because of commercial reasons, results of the 
dilTercncc reveal technique — interviews in which husbands and wives were con 
fronted with ihcir conflicting statements — were more widely published (Fred 
Strodtbcck, Uric Dronfenbrenner) Ernest Burgess interviewed husbands and wises 
simultaneously but separately over a number of years and our as yet unpublished 
secondary analysis of data showed considerable disagreements between the spouses 
even about such hard facts as length of betrothal 

In an internationally comparative study Alex Inkcles found an especially low 
percentage of belief in chances for upward mobility in France and a prevailing belief 
riS> “"I* Z.eg=l showed in David 

A Ronlledge 1954) actual mobiliiy 

Chances for interecnerational mobility from manual to 
nonmanual were even a bn belter in France than in the United Stales The con- 
holJever" imlrrunr'"''"* “"<• chances m two politics provides, 

obll^t'vecSnlexIs '‘'“'"''"'S subjeclive arrangements wilhm 
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Almonil and Verba ajVcd crosv jccimnv rif the populations ofvanous 
wlions about ttlul one ii proud of m one's nv^n country, and iJjc 
bcl«ccn-nanon difTcrences m re^^an^es were u«cl as expressing stability 
nr ^liljcal tnjtiJutions. certainly such answers arc mainly reflections of 
coJIeiiivc properties in the mmds of respondents In the same inter- 
nalionally coinparatnc smd). cross sections of respondents were asked 
to name what ihej considered to he successful forms of protests against 
bureaucratic measures, again, these aggregated opinions arc most 
certain!) not the most direct measure of citfrens’ ability to influence 
authorities Aggregating indisulaul responses as a measure of the 
climate" of opinion in a communit) is equally problematic; while 
there « reason to belicse that something like the "breakage clTecr' of 
Lazarsfcld and Ilercbon docs exist as the property of a district, an 
opgfegation from the resps’nscs of a smalt sample of individuals is 
hardly Its best measure ** 

In ordinary survey research the individual is usually treated — albeit 
by implication ~ as an atom Merely adding atoms of course does not 
make for molecules To continue with the same metaphor, just how one 
represents "socul" molecules if the interrelation of atoms as part of a 
molecule is svhat one is interested in happens to be largcl) unexplored 
It IS this kind of interrelation between the character of individual units 
and the character of higher-order units that the group of social scientists 
in the Hureau of Applied Social Research of Columbia University has 
studied. 


CONTEXTUAL AND RELATIONAL ANALYSIS 

Uzarsfeld and his collaborators have attempted to develop pro- 
cedures that provide direct expressions for some kinds of interaction 
between properties of individuals and properties of groups and m- 


*' Cr AtmondanJ Vcft>a.«’/T «r . „ ^ , , 

”The procwJorM «o which *c refer are described in Bernard Bcrelson « af. 
ItuliwfChicago t/n.iw<iyofCb«o»8oI’rcss.l9^) «mpa« also J^ul F Uan- 
fcIJe/<r/ 7 >rF«’Wfr C/ro/retrtew' Vml DucH. 2 nd ed , 1949) A rnore refined 
aiicmrt to accouni (or ibcconuway of votms m local corr.mumti« is d«crit«d m 
David H SegaUnd Marshall VV Meyer. *7^SoC3l &ni«l ofFoIilical ParliMn- 
vhm '• Chapw 9 of ih» volume The m«l dilTercrrtiaicd auempi to retaic mdividual 
and renlctmal rrorertics is Ham O Khagenann. BtsUmmungsgrundt dtr f\ohUM- 
scMdung Im IhirHUswgx^'ahlKrtis iMhrmn £btf rtgiomite Hahlanahse (Afeiscn- 
heima C.. Anion Ham. 1W8) 
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strtutions“ StoufTer had observed that soldiers attitudes toward 
promotion can best be explained by relating them to relative opportunl 
ties for actual promotion within given military units For an approacn 
that in this way relates individual attitudes and properties of a group 
Lazarsfeld coined the term “contextual analysis Contextual analysis 
IS then the description of a member of a collectivity by using properties 
of the collectivity . 

Soon It became apparent that in relating units to collectivities to 
finite sets) indicators of different types were used, and this in turn re- 
flected the differences in the type of explanations for the unit as part o a 
collectivity Closest to the usual orientation is the so-called relationa 
analysis,” which characterizes an individual relative to other individua^ 
units of the same set of persons Describing a pupil’s grade as B 
and also as ‘‘second quartile” is an example, the first grade denotes an 
ability as an absolute property, that is, independent from the specific 
class to which he belongs, while the second attribute expresses per- 
formance relative to the distribution of abilities in a specific set In this 
way, the individual is accounted for as member of a group, but the group 
IS treated only as an aggregate of specific individuals 
Indeed, for many aspects of behavior this representation of an in* 
dividual’s environment is appropriate, but not for all aspects Thus, a 
chance for upward occupational mobility is a consequence not just of an 
individual s relative ability but also of changes in the mix of jobs m the 
economy at large The latter type of attribute of a set is called a “global 


** Cf Paul r Lazarsfeld and Herbert Menzel “On the Relation between In 
dividual and Collective Properties ' m Atnatai Elzioni (ed ), Complex Orgamzatior^ 
(New York Holt, Rinehart, & Winston, 1961) pp 422-440, see also James S 
Coleman Relational Analysis’ m the same volume Further discussion will be 
found in Hanan C ScUin and Warren O Hagerstrom, “The Empirical Classificali^ 
of Formal Croups * American Sociological Fevtew 28(1963) pp 399-41 1 . Ernest Q 
Gimpbcll and C Norman Alexander, ‘ Structural EfTects and Interpersonal Rc* 
lalionships ’ American Journal ofSociohg}, 71 (1965), pp 284-289, Leo Mellzcr, 

• Comparing Relationships of Individual and Average Variables lo Individual 
Responses.’ American Sociological Revien 28 0963) pp 117-122, Erwin 
Scheuch. ’’Eniwicklungsrichlungen bci der Analyse sozialwisscnschafllicher Daten 
in Kdnig (cd ), op ell pp 655-685, especially pp 670-677 
I contextual properties in empirical research that influenced 

^Mrsfeld smeihodologicaloricntaiionare Samuel A StoufTcre/ii/ Vie American 
i^Wirr /(Princeton Princeton University Press, J949) pp 256ff.pfljj//n, Seymour 

M Lit^ter c/ Union Demoerac} {Gkncoc Free Press. 1956) especially Appendix 

1, Herbert and Elihu Katz. Social Relations and Innovation irt the Medical 

* Quarterly 19 (1966) pp 337-352 

Cf Ja^ S Coleman Relational Analysis — the Study of Social Organisa- 
tions with Survey Methods.’ Human Organization 77(1958/59). pp 28-36, and also 
in Ltzioni (cd ). op clt 
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property,’* that is, a property that exists for all umts independent of the 
composition of the group Tlie term “contextual anal>sis’’ is now used 
primarily for an analysis that relates global properties to individual 
attributes Examples of this type of analysis arc as jet QUJte rare, and 
for ecological data as a form of global properties there are exceedingly 
few examples 

So far, contextual analysts and relational anab sis are largely attempts 
to overcome some of the limitations of ordinary cross sectional sur\eys 
This methodological approach proceeds from a more realistic assess- 
ment of an individual’s cross-pressures rather than from reflections 
about the striictureof collectivities However, we do note the beginning 

of a realization that in many of our observations on the level of in- 
dividual actors we do not really take seriously the fact that there is some- 
thing like social structure Hence there has been the tendency to 
proceed immediately from observations on the level of individuals to 
statements about the character of bigher-ordcr units, and hence the in- 
clination to equate observations of individual attributes with the state 
of the polity This tendency is, however, due not merely to a particular 
blindness of empirical researchers but also to a gap in theorizing Good 
substantive theory would need to specify the intervening processes and 
structures, and if this were done more explicitly, it would also be easier 
to develop appropriate measures 

At this point in the Ime of arguments it is possible to relate the dis- 
cussion of research methodology to the initial sialement that explana- 
tions are more interesting when they connect phenomena on dilTcrcnt 
levels Together with the principle of direct measurement this con- 
nection has implications for research design and for theorizing 

1 A research design becomes more powerful when it provides for 
direct measurements of phenomena on different levels 

2 Theories in the social sciences that maintain a dependency between 
the state of a higher-order collectivity and the characteristics of the 
units within the collectivity should specify the processes of interchange 
between system AivciV 

According to this perspective, the controversy between advocates of 


Most of the toelhodological discujswn and the majonty of the mponant ex- 
amples m empirical work are accounis of the 

Voting bchaMor appears to be especially suited lopennii Jhede\eJopnwnt of models 

since ihc number of alternatives IS limited 

Family Soclofuation and /nteracdait fiwftr (Gfcocoe Frw Press 19^ and m 
Ecemom) (London Roultcdge4.Keg3nPauM957} espeaaHyOtajHefll 
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ecological measures and those preferring indnidual measurement be- 
comes a spurious issue If it is easier and often less rishy to infer 
structural properties from survey data than to derive statements about 
individual behavior from ecological data, this is due primarily to prac- 
tical difficulties rather than a matter of principle =' It so happens that 
ecological data as a main form of “process-produced data” arc organize 
according to administrative concerns rather than research interests 
And because of this particular difficulty in working with ecological data, 
grouped data in general have come under unnecessary criticism 

From this analysis one may conclude that cross-level measurements 
will become more common within the near future The mcthodologica 
discussion will help to overcome some of the reservations that exist 
tow ard the use of one or the other kind of data, and data banks as a new 
type of facility in the social sciences will increasingly store several kinds 
of data m an easily accessible formal ** The mam handicap is probabl) 
still the lack of explicit conceptual orientations for relating data from 
different levels 

In the remainder of this contribution we shall suggest such a con- 
ceptualization for relating individual properties to data for collectiv ilies 
and shall then give some examples of the usefulness of combined cross 
level measurements and cross-level theories Our particular notion has 
been influenced by Max Weber’s conceptualization of social action 
isoziales Handeln) and to some degree by George H Mead Of course, 
this is just one attempt among other possible approaches It is made m 
the expectation that at this particular point in the development of 
empirical research new conceptualizations and examples of successful 
usage will stimulate methodological discussion, which, left to itself, 
remains at present preoccupied with details of the research process 


THE USES OF OPTIONS ANALYSIS 

The higher level of any iwo-Icvcl measurement or explanation can be 
conceptualized as the “environment” of the lower level units Let us 
assume that the higher order unit combines lower level units that have 
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some heterogeneity or that the higher-ordcr unit is homogeneous merely 
with respect to the character»stic(s) that define(s) its boundaries We 
may now postulate that this environment vi ill be reacted to differentially 
by lov.er-leveI units that have difierent locations relative to the points 
in the distribution of characteristics of the higher-order unit The 
different locations of lower-level units in the set that makes up the 
higher-level unit can thus be represented by distinguishing several 
subsets If we denote every lower level unit by a combination ofseverai 
characteristics, the subsets have to be conceptualized as constituting 
relatively homogeneous but partially overlapping “selective" environ- 
ments for the lower-level units In our particular approach, which we 
term “options analysis," a subset can be thought of as a selective cn viron- 
ment for a group of lower-level units m the sense that for each particular 
subset the higher-level unit is a combination of characteristics with 
different values 

Let us interpret lower level units to be persons and higher-order units 
to be some collectivity with which they are in immediate interchange, 
such as the community For any given group of actors, defined by a 
common position, the community can be viewed as a “raw material ’’ 
In other words, for every group the community is a different constella- 
tion of “objective" characteristics Theconstellationofobjecfivecharac- 
teristics that is unique to a group of persons with a given position should 
be understood as the “subjective environment of that group ” Con- 
versely, the combination of values m the higher-ordcr set j i that forms 
the subjective environment for a group with the characteristics v, 
should be predictable from its position within the larger content j , 

For a graphic representation let us first assume that the biglicr-ordcr 
units j, „ are factories The left-right dimension represents the dis- 
tance of departments or working groups (or any other smallest sub- 
division of the personnel) from the act of selling the product, while the 
up-down (north-south) dimension represents status within the company- 
The subdivisions of j. represent the departments and the 

individual workers are represented by subscripts m arabic numerals 
.Tjd ^ (See Figure 1 ) 

The mathematics of this notion has not been worked out. but there is 
no doubt that it can be formaliwd m some sort of topological language 
Apart from this, however, some implications for analysis across levels 
of observation should be fairly obvious 
In research about motivations to wort, the level of the factory as a 
unit m w hich w or k is performed »s obv lously important andneeds to be 
represented via direct measurement of its properties This eraphic 
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Figure I 

Graphic Presentation of the RELAUONsinp Between 1-ower- and 
Higher Level Units 


representation shows that m some respects a breakdown of workers* 
responses (or other attributes) by factories will not be the most efficient 
strategy, what a factory means as a constraint on behavior or as an 
economic unit differs according to the location of the employees withm 
the factory Accordingly, the properties of the higher-level-unit factory 
should be assigned with differenl combmalions of weights to the 
departments and to the individuals within the department — depending 
on their loeation in a two dimensional space In the same manner one 
can represent the subjective (or selective) meaning of a geographic unit 
or a social organization o s 


slatcraenis it follows that the greater the homo- 
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variables for winch the same type of higher unit is the level of integration 
(or has an influence w its own right on the behavior of Iower*order 
units), the more we are justified in treating its cfTccts as “structural “ 
The notion of looking at a higher-order unit as an objective environ- 
ment for lower-lcsel units is, cetens paribus, most appropriate for such 
structural units 

From this observation it follows that the notion of options analysis 

a technology and a perspective of treating higher-level units as an 
envirofiment — is fruitful only for specific types of units In addition, 
a technical prerequisite for it is a “muttiplc measurement," preferably a 
combination of individual and global properties Provided these con- 
ditions are met, it is necessary to compute “clusters" of lower-level 
units or subsets These subsets are then related to their higher-order unit 
by denoting (according to their relative position) the subsets with dif- 
ferent values for the higher-order unit (see Figure J) An alternative 
procedure, and one that is closer to the original notion, is the computa- 
tion of the difTcring degree of determination of clusters of lower-level 
units from characteristics of the highcr-order units Still another variant 
of this analysts emphasizes the differences in explanatory power for 
configurations of attributes of the higher-order unit, this is the most 
direct expression of a "subjective environment " 

We shall give but one technically simple example We may observe 
a general correlation between the level of economic development by 
areas, on the one hand, and family composition and family stability, on 
the other Provided we observe that with a very high level of economic 
development several forms of the family coexist, we would interpret this 
as a reduction of constraints on the family which permit a greater 
number of options The “cost* of these options would then be deter- 
mined by analyzing indexes of family stability for families with different 
member compositions 

The notion of a subjective use of an objective environment had been 
first invoked ex post facto to help solve some problems m a cross level 
analysis The substantive problem was then to account for the effects of 
some kinds of communities on voting behavior some commumPes 
appeared to have an influence on voting in addition to the usual in- 
dividual properties, while others did not We then remembered 
Uexkuirs distinction between an objective Umgebung and an Umuelt 
which differed according to the needs of the various species This notion 
from biological ecology seemed one possible way to explain the varying 
strengths of relationship between system levels In addition, this dichot- 
omy also appeared related to one pcrspcaive in sociology that was 
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expressed by Max Weber as the task of sozmies Handeh deulend zu 

'^Subsequently, this approach guided a number of empirical studies, 
and It became possible to include appropriate features in the design 
Among those v.cre an insestigation of differences in primary group 
influence on \oting behavior and a study of returnees from Germany o 
their nati\e Greece and Turkey The latter may be helpful m indicting 
some of the consequences for research design The problem formulation 
Itself was one calling for cross level analysis in that the interaction 
betw een lev els of social reality was the major independent factor for t c 
phenomenon of readjustment The character of the native environment 
could be expected to have different consequences depending on the 
attributes of the individual returnee However, merely comparing t e 


influence of large contexts such as the nationstates of Turkey or 
Greece would be too gross a description of environmental effects, for 
the individual returnee Turkey and Greece were, in part, abstractions 
that exert influences only through several mediating institutions There- 
fore, extensive use was made of descriptors for such immediate en- 
vironments as the larger famil), the peer group, and the work 
setting While the factoiy can be understood as one setting in which 
Turkey or Greece as economies become concrete for the individual 
returnee, the factory in turn is an individual unit influenced by the 
economic organization of the nation state In this way, the influence of 
the nation state is filtered through many intermediary steps down to the 
individual The fact of ‘ filtering” phenomena in turn gives opportuni- 
ties to the individual returnee to choose work settings of different 
character where he experiences the effects of an abstraction such as 
national economy in different ways 


'SUBJECTIVE EfiVlROUMEtiT' IN STUDIES OF 
VOTING BEHAVIOR 

The notion of “subjective environment” and the formalization of this 
notion were first used in an analysis of election data to account for the 
stability of voting in some communities of Southwest Germany This 
stability IS in itself a well-known phenomenon, but in attempts to gi'® 
an explanation we came across some other cunous regularities In small 
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communities with a high stabijit> of election returns there were also 
sometimes abrupt switches in voting and the new patkrn then tended to 
exhibit considerable stability A breakdown of community size by 
degree of political homogeneity revealed a monotonic but strongly 
curvilinear distribution somewhat related to a hyperbolic curve Most 
puzzling at first were the strong contrasts in voting behavior between 
neighboring communities for which there were no obvious explanations 
by community characteristics All these obscriations confirmed that 
communities as such someiimes had a sironc influence on voting be- 
havior, m addition to the usual individual characteristics 
Of course, there were labels to attach to this observaiion, such as 
“climate of opinion and breakage cfii“ct which Lazarsfeld and 
Berelson had coined However, these terms are merely descriptive and 
not explanatory The one explanation sometimes given m this connec- 
tion, that voting patterns are determined by the principle of inertia, 
name!) that once established they will continue would not be specific 
for a collective such as a community also it agreed with only some of 
the acts. Fortumtely, the design of our main empirical study, the 1961 
election study, permitted further analyses 

The BunJcsiagsv>afifsuidie 1961 combined three national sample 
surveys at three points m time with constituency panels that were inter 
Viewed m four waves, special attention was paid to interpersonal 
processes The interview data were then supplemented b) a program of 
systematic field observations by content analyses of newspapers and 
television coverage and by a collection of ecological data for communi 
ties and within larger communities for census tracts Thus, vanousforms 
of “direct measurement for phenomena at di/Tercnt levels of the polit 
ical process were available Here only the cases of the constituencies of 
Heilbfonn and Munchen Land will be reported — two units with quite 
different objective charactemlics and with contrasting political 
histones 

The varied forms of measurement were essential for the later attempt 
to give the notion of a ‘breaka^ effect m part a real explanatory 
meaning When working with voting statistics for different types of 
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subdivisions or the Ini-ge constituencies (more than ^s<m*)j 

,t could be observed that some of the strongest differences 
ecological units occurred when these units were adjacent ’ 

regularity seemed to hold only for rather small communities (delimic y 
below about 500 inhabitants, and sometimes up to 1,000) and it most y 
disappeared for larger towns and cities In the more populated gc^ 
graphical units voting behavior varied in the expected way wit 
relative frequency of demographic characteristics, and it was a so o 


usual heterogeneity , 

To account for these observations it was postulated that soci 
characteristics that are perceived by the actors as antagonistic express 
themselves also as differences in voting behavior This statement imp les 
that one mam orientation in casting a ballot is the confirmation of group 
“belongingness,* in an earlier analysis it had already been 
strated that the voters treat parties largely as reference groups ^ This 
generalization of social differences into equivalent difTcrences in voting 
behavior was presumed to be strongest for social charactcnstics o 
general relevance (operational definition generality of relevance equals 
the number of situations for which quasi groupcharactenstics determine 
behavior) Bclongingtoacommunitycanbe shown to mean for actors a 
quasi group characteristic such as age or occupation Let us now 
introduce the following as an additional proposition the smaller the 
community and the more homogeneous the environment, the stronger is 
the effect of belonging to a community as a quasi group membership 
Therefore, in small and relatively homogenous communities voting 
should have the function of affirming community distinctiveness, m 
larger communities the equivalent function should be served by voting 
according to membership in antagonistic quasi groups This mechanism 
leads to stability of voting behavior over time and manifests itself m 
phenomena called the “climate of opinion” in a community 

To test this line of reasoning as explaining differences in voting be 
havior between geographical and social subdivisions, we determined m 
the first step those communities that displayed a distinctive voting 
pattern (operational definition, clear majorities for any one party) 
In the second step we selected communities that were unusually 
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homogeneous lAith respect to the following characteristics (1) occupa- 
tional composition, (2) religious denommation The communities that 
had been identified in this way wre then plotted on a map, this 
made it easier to identify proximity as one major variable m our 
theory of “habitually antagonistic voting behavior " 

The following notion could now be empirically tested Distinctiveness 
of voting behavior is a function of physical proximity of communities 
v-ith opposing sets of characlcnstics TTie greater the homogeneity 
(withm-class differences) of adjacent communities with differing char- 
acteristics, (he greater the chance that these communities would show 
mirror images of voting results Physical proximity of communities can 
be considered as a rough index of social contacts between these higher- 
order units From studies of social class we knew that antagomsm was 
greatest between those groups on a scale of social prestige which were 
cither adjacent or at (he two extremes of the total distribution , the latter 
were safe hate objects, while adjacent groups just tended to “rub" each 
other more m daily contacts The analogous phenomenon m voting 
behavior by ecological units would be intensity of social contacts as 
transforming difference into antagomsm This effect would sometimes 
be strong enough to reverse the correlations between group membership 
and voting behavior that are usual m the society at large For most com- 
munities or ecological units below 500 inhabitants, and for many be- 
tween 500 and 800 inhabitants, this conceptualization could account for 
a large number of deviant election results if we understand by deviant 
a change from the normal relation between group membership and 
voting 

The notion of "climate of opinion” can now be restated as an ex- 
planation of phenomena instead of merely as a label Thus the "climate 
of opinion” IS the generalization of a distinct pattern (or configuration) 
of historical and/or social characteristics of a territorial umt which 
include voting behavior, together %vith the continued assertion of this 
distinctness in voting patterns If one substitutes for the term “terntonal 
units” the term "groups, ’ the preceding statement becomes a general 
e.%f 0 j 3 J 3 sJJC! 3 £if “deviant" votmiu that ts, of voting behavior that does not 
follow in the usual way from a combination of group characteristics 
In a short form the phenomenon of "climate of opinion” can be re- 
stated as an “assertion of a generally deviant or marginal position of a 
group vis-a-vis Its neighbors by means of voting behavior " 


Erwin K Scheuch (with flans J Daheini) ‘SooaJprwiigc and w«^eSch)Cb 
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issufi’no S.pp 65-103, especiaUypp 



152 


ERWIN K SCHEUCH 


An essential condition for this mechanism to operate is a h'Shde^cc 

of internal homogeneity, with a relatively (that is, in ° „ 

group heterogeneity) strong difference from other groups The ewct n 
strongest if there are both internal homogeneity for several charK 
terislics at once and low spatial and/or social distance from “‘'’“'•SrOTP' 

For groups with deviant social characteristics this condition is o 
most often in classical minonly group situations, while for tern or 
units It tends to occur only when such areas arc rather smal n 
way, the often obscr\ed longterm traditions of voting be avior i 
minority groups, the stability of voting in some terntonal units, an 
even the continuity of election results m societies with strong socia 
cleavages can be explained by the same mechanism 
The conditions that have just been described imply that for persons 
within these groups or territorial units, subjective and objective envi 
ronroent would coincide more than can be expected According}* 
for larger communities and for heterogeneous territorial units a 
different kind of cross-level explanation appeared indicated, as well as a 
difiercnl technology in tracing the interaction between levels If u'* 
deed subjective and objective environment dilTer from each other, 
dependent measures for the phenomena at different levels are required 
To increase the contrast with the conditions that we previously 
examined, we selected m the next step of this analysis the largest cities 
of several constituencies Special attention was given to the cities o 
Hcilbronn (in Southwest Germany) and Freismg (in Bavana near 
Munich) A breakdown of election results into precincts again showed 
stronger difrcrcnccs in voting behavior than could be expected from the 
differences m social composition Again we found m a number o 
precincts that an unusual homogeneity of voting behavior continued 
over longpcriods, even though the original population had in part been 
replaced by new inhabitants These stable political units had remained 
stable with respect to one social characteristic as well, in spite of the 
exchanges of population with other areas namely continuation of a 
population with an extreme social status position For the population 
m those areas, the city as a unit had a limited relevance, and voters 
tended to assert ihcir marginal positions by means of voting behavior 
just like the population in “marginal ’ small communities In general* 
however, survejs showed the expected pattern of a moderate association 
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between group characteristics and clcctoraJ behavior, with the various 
territorial subdivisions having litiJc predictive value 
An analysis of voting results for Freising demonstrated surprising 
irregularities of election results m various areas Ecological data did not 
lead to any explanation, but survey data did Freising is within com- 
muting distance of Munich as the economic center of the whole region 
It was natural, therefore, to analyze the effects of commuting or non- 
commuting on voting behavior holding membership in quasi groups 
(like occupation or age) constant In many social groups commuters 
voted as their counterparts in the metropolitan city voted, this is 
especially true for skilled workers In other cases (he commuters 
adopted the voting pattern of their bedroom community Freismg: this 
IS sometimes the case with white-collar workers Which territorial unit 
had an influence on voting (or rather can be treated as a reference area 
in accounting for election results) varies by social group — a variation 
that can be explained only by using survey data 
Still another group of data needs to be considered before a summariz- 
ing statement for the interrelation between higher-order units and in- 
dividual characteristics can be attempted data for intervening in- 
stitutions This can be demonstrated by the failure of cither aggregated 
individual observations or ecological data to account for two phenom- 
ena* (I) variations in (he success of the new radical party, the NPD 
and (2) dramatic shifts m voting that occur m small cornmunmes 
In communities of less than 800 inhabitants long periods of stable 
voting behavior may be interrupted by sudden reversals of political 
preferences, resulting in a new stable voting pattern, or by equally 
sudden and then persisting splits within a unit hitherto characterized by 
high political homogeneity A number of such dramatic changes could 
be observed in small communities of the Saarland A more detailed 
observation of these communities revealed that straight reversals of 
established political preferences in the electorate were usually accom- 
panied by switches in the parly allegiance of local leaders Two con- 
ditions had to coincide m order to produce the phenomenon of re- 
1 ersaJs * small sure ofcommunity plus relatively great homogeneity and a 
change in the party preference of leaders Surveys confirmed (hat in 
these very small communities participation in politics i$ not behavior 
funciionally differentiated from other behavior Under such conditions 
voting for an opposition can be looked upon as a form of deviant bc- 
hav lor This particular deviance « meant to be made cas> by the secret 
ballot, but surveys again showed that voting is not thought to be com- 
pletely secret (villagers make quite accurate guesses about who votes 
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against the majority), and even that rather painless deviance still re 
quires a little extra effort on the part of the deviating person Unoei 
those conditions a survival of an older and general pattern in om 
geneous local units occurs the ability of leaders — typically t ey ar 
functionally unspecific leaders — to count on the general loyalty ot their 
local groups 

The sudden splitting of votes in small communities amounts lo 
breakdown of this order and occurs usually either when there is a sp it 
a previously monolithic leadership or when the leadership allies wit a 
party that to the villagers represents an “outside" world hostile to them 
or when a new functionally specific leadership for politics moves in o 
the community Again we can rely on surveys to confirm what an 
analysis of ecological descriptors leads us to suspect, namely that even 
in small communities the social heterogeneity is often greater than t e 
heterogeneity of voting behavior This general heterogeneity does not 
become political pluralism, however, unless there is some support via 
intermediary institutions or persons According to survey figures and 
ecological data, social heterogeneity is translated into political pluralism 
if the deviant political behavior receives outside support, and unionized 
commuters are often the first group to import the prevailing political 
culture at their place of work into their villages This observation can be 
generalized* Whenever a person is moving in more than one environ- 
ment, he is likely to vote independently of the local majority, and wheit- 
ever in this or any other way a threshold of “exemplary heterogeneity 
m voting IS reached, the previously controlled social cleavages tend 
suddenly to express themselves fully in political pluralism 
The surprising success of the new radical party of the right, the 
Nationaldemokratische Partei Deutschlands (NPD), even m the 1965 
federal elections and subsecjuently in Lander elections, affords us an 
opportunity to demonstrate the need for a joint use of separate in- 
dicators for several aspects of reality A first analysis of ecological data 
had led most commentators m Germany to conclude that the sudden 
increase in votes for the NPD was due largely to the refugees Indeed, 
there were quite strong correlations between the high pefcentage of 
refugees m a constituency and the successes of the NPD However, 
survey figures gase only an extremely weak correlation between refugee 
status and the tendency to vote for the NPD A more complete 
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analysis of conslitucncj characjeristiw revealed that this contradiction 
bcl\%cen survey and ecological analyses >Aas not jusl another instance of 
the ecological fallacy 

The percentage of refugees ina constituency (or territorial subdivision) 
had a real influence on the tendency of individuals to vote for the NPD 
but in a more indirect way than had been assumed m the first cxplana 
tions Refugees had been in general much less extreme m their political 
views than the pronouncements of refugee organizations — a usual 
phenomenon if one considers refugee orgimzations as pressure groups 
with a weak mass basis yet deriving their cficctivcness from claiming 
masssupport The NPD on the other hand had to rclyforpropaganda 
on word of mouth contacts plus old fashioned tools of political pro 
paganda (such as leaflets and local rallies) since it had little money and 
no real access to the mass media In view of thescconditions the correct 
causal chain w as [a) the higher the percentage of refugees in a territonal 
unit the stronger were organizations of political extremists (b) the 
stronger the organization of political extremists the greater was the 
success of the NPD 

Hcncc the percentage of refugees m a constituency had a causal 
influence on the effectiveness of local political propaganda instead of 
being an indicator of the tendency ofrefugees as individuals to vote for 
political extremists 
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The Spatial Structuring of Information Flow 
and Partisan Attitudes' 


Awareness or the limitations of cross sccdonil studies m describing 
social processes over time is widespread in the behavioral sciences ^ Jn 
many areas this is reflected m the construction of more dynamic models 
either by replicating social process in simulations or by cross sectional 
analysis containing variables describing change in the observational 
units One aspect of this concern for process m political science is 
interest in information flow This is seen m studies using survey data 
such as those by Converse* and Putnam * and in studies using coarser 
aggregate data such as (he recent attempts of Alker* and of McCrone 
and Cnudde^ to model political modernization 
This emerging concern among political scientists coincides with a 
similar development in human geography In this field much interest is 
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centering on the diffusion of innovations over space, Hagerstrand, for 
example, has examined the diffusion of subsidized pasture an o 
certain religious groups in Sweden and of Rotary Clubs in cs c 
Europe,’ and Morrill Ins examined the expansion of the Negro ghetto 
in American metropolitan areas* In these studies the role 
relations over space and the impact of distance upon the proba 1 1 y 
of social relations has been emphasized as crucial to explaining I c 
flow of information preceding adoption of innovations further, t ere 
IS some good evidence for such spatial diffusion movements ® 
literature of political sociology Rokkan and Valcn’ have described l e 
narrowing of Norwegian urban rural political participation differences 
over time, and Myron Weiner has examined the spatial extension o 
elite recruitment into rural areas in Bengal 
This developing body of empirical studies and related theory suggests 
that a spatial approach toward information flow and political behavior 
or attitudes might help resolve some currently vexing problems In 
particular, certain contradictions already appearing m the literature oi 
contextual analysis may be resolved by examining the area! scale at 
which the networks connecting individuals to contexts operate 
This paper has two aims first, to identify certain contentious issues 
in recent contextual analyses of political behavior and to suggest some 
alternative interpretations and solutions, second, five hypotheses 
summarizing these new interpretations and solutions will be presented 
and tested using data collected in the Columbus metropolitan area of 
Ohio 


CONTEXTUAL ANALYSIS 

Contextual analysis attempts to explain an individual behavior 
pattern m terms of the social context or milieu in which the individual 
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munication and the Diffusion of Information Papers and Proceedings Regional 
Science Associaiian 16 (.1966) pp 27-4'» 

Al.cn.a..ves C«- 

Tu’m™!' M Vakn The Mobilizalion or the Penphery Data 00 

Is 6 0962) l" Nonv-ay Ac, a Socc- 

"S 093S'Tp 



IVrORRfATlON FLOW AND PARTISAV ATTITUDES 


159 


1j\cs When certain of hts own social or other personal attributes are 
held constant Although a sizable literature concerned with the 
identification of such effects «s ducloping technical concerns will not 
be the focus of interest here rather, w-e shall be concerned with the 
development of the contextual model as providing evidence for a 
political information transmission and influence process 
Coleman has identified the contextual model as a social relational 
model basing its conclusions on assumptions of personal influence 
More specifically a recent paper by Putnam exemplifies the information 
flow interpretation of the contextual model and illustrates the type of 
conclusion possible from such contextual analysis ** Putnam is most 
interested m the effects of political context — measured by the per cent 
Democratic at the county level — upon political behavior He finds 
first that community influence is mediated largely through friendship 
groups and that members of secondary associations are more susceptible 
to community influence than are nonmembers He goes on to demon 
strate that friends in secondary organizations arc more representative of 
community opinion and that people who belong to such organizations 
are not only more likely to have friends lypica) of the community as a 
whole but also more sensitive to the opinions of their friends }lis paper 
has gone a long way toward establishing the promise of this area for 
further investigation 

The comcxlual model, as developed thus far, contains certain weak 
nesses First in its present form it is not able to discriminate between 
senders and receivers We ate unable to offer any improvement in this 
direction Second, implicit in much of the work using the contextual 
model IS the assumption that the individual who is being affected by the 
context via an influence process confines his friends and associational 
contacts to the ecological unit chosen for analysis Yet intuitively one 
knows that is not necessarily so, not only will the number of external 


‘'For example see Warren E Miller One Rwty Potties and the Voter 
American Pa/ifKvl SmfK-f JO(1956),PP 707 725andPhilpH Ennis The 
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contacts increase with decreasing areal size of ecological unit but thos 
has mg contacts outside the ecological unit are less likely to be 
by information biased by the context within the ecological unit Thira, 
the model has not yet incorporated a measure of political susceptibility 
to information flow Interpretations of contextual effect assign a con 
version or support role to information, but the resistance of difiercn 
individuals to the partisan cues inherent in much information has ye 
to be evaluated Fourth, and most importantly, the contextual mo e 
does not examine receivers before and after receipt of information m 
order to determine the behavioral impact of that information flow 
individuals are influenced to change their partisan attitudes or to 
strengthen them, what evidence do avc have that this takes place as a 
function of the political context*^ Each of these points will be considerc 
in turn and suggestions made for incorporating more realistic assump 
tions into the contextual model 

If we examine the second problem from the spatial viewpoint, it seems 
clear that it can partially be solved by examining formal and informal 
networks at areal levels or scales other than that of the locality Like 
most students of contextual effects, Putnam works at one areal level — ' 
in his case, the county level He assumes that all the individual s formal 
links take place within the county unit while informal links take place at 
lower levels and arc likely to provide the individual with information 
unrepresentative of the county as a whole Yet m reality the friend 
ships of people and their more formal links take place at a variety of 
areal levels some confine their friendships to the immediate neighbor- 
hood, while others have friends at some distance from their home 
Furthermore, some voluntary organizations give their members scope 
to come into contact with other members of very diverse areal origins 
while others confine ihcir memberships to the immediate locality 
Recognition of these facts demands that we attempt to specify the 
spatial characteristics of informal and formal links more carefully and 
that theory be formulated regarding their efTects on partisan attitudes 
Formal and informal groups can be arrayed on at least three spatial 
dimensions (1) the areal extent of the hinterland of one branch or 
meeting place m the system of forma! or informal links, (2) the degree of 
centralization withm an organization or the degree to which branches or 
individual nodes look to some central policy making or administrative 
® y of either a formal or informal nature — for information advice, 

orders, or propaganda, (3) the connectivity between those branches or 


Ibid 



JN'FORMATJOM FLOW AND PARTl'UN ATTITUDES 


161 


meeting places (by "connectiviij" is meant the degree to which all nodes 
in a nctnork are connected one with another) Thus, labor unions are 
characterized by branches with areally hrge hinterlands and with strong 
connections with a central co-ordinating agency publishing union news 
and views Parent-teacher associations, on the other hand, have areally 
very restricted hinterlands and low levels of connectivity between 
branches The three dimensions are not necessarily correlated; Jewish 
and Roman Catholic churches, for example hie Protestant churches, 
have areally limited hinterlands but. unlike Protestant churches, they 
also have relatively strong central co-ordmalion and circulation of 
propaganda Likewise though the areal extents of the information 
catchment areas of Protestant and Catholic families are likely to be 
very much (he same — other things being equal — minority group 
Roman Catholics arc likclj to be nuivh more interconnected via mar- 
riage ties and attendance at the same church 
The recognition of a senes of information fields or personal informa- 
tion catchment areas v.iry(ng along these dimensions is implicit in the 
work of Putnam** and of other workers such as Segal and Meyer,*’ but 
Ihcir failure to apply it has led m at least one instance to a curious 
contradiction between their findings Scg.il and Meyer both examine the 
cfTccls ofeontexts at the city and ward level and both draw attention to 
the cfTccis of dilTcrcnt religions in integrating the individual into net- 
works of greater or lesser connectivity over the nation as a whole They 
find, for example that the more cohesive religious groupings of Roman 
Catholics and Jews are considerably less affected in their political 
behavior by the social milieu of the ward than are Protestants ‘® 
Putnam, likewise, recognizes that “obviously ‘community environment’ 
could be defined at any number of levels ranging from the slate down to 
the precinct," though exigencies of data force him to work at the county 
level Again, like Segal and hfeyer, he finds (hat members of minority 
religious groups are much less affected by (he political context than are 
Protestants, ascribing this to the reduced social interaction of the 
minority religious groups with the remainder of the community '’The 
contradiction in tiicir woni arrses m ffteir e.tC-c/a of 

voluntary group membership At the ward level, Segal and Meyer find 
that those belonging to more voluntary organizations are less affected 


•’ Ddvid R Sega) and Marshall W Meyer. The Social Context of Pohtrcal 
Partisanship,'’ m this book. Chapter 9 
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by the milieu than arc those who belong to fewer organization 
the county level, Putnam finds that those belonging to more 
groups are more affected by the milieu Although Segal and Mey" " 
dealing with a milieu defined in socioeconomic terms while Putnam 
examining the effects of political context, quite possibly the ' 

tion IS due to the different levels of analysis that tiic two ^ 

People belonging to more organizations seem hkcly to have t 
additional affiliations outside the voting ward but largely within tn 


I - of 

This paper is concerned primarily with the variation in area siz 
information fields defined in terms of formal or informal affiliations 
Thus, contextual effects on those with local friends or those "d 
nonlocal friends will be examined separately Wc also consider l ^ 
impact of the context on those belonging to voluntary groups, the cc s 
of which have memberships that arc areally intensive compared wi 
those belonging to voluntary groups the cells of which have area y 
extensive memberships 

One criticism that we made earlier against the contextual model was 
Its failure to take into account individual variations in susceptibility to 
flows of information In fact, resistance docs seem to vary greatly 
according to demographic or psychological characteristics Rokkan and 
Valcn, for example, have documented the resistance of rural women m 
Norway to stimuli encouraging them to more active political participa- 
tion Likewise, other data from the United Kingdom have shown the 
greater resistance of older age groups and women to the adoption of 
Labour Parly preferences when social class is held constant While 
it is true that political sociology has succeeded in discriminating con- 
ceptually between access to information and susceptibility, it has not 
yet succeeded operationally in isolating susceptibility from more subtle 
psychological aspects of accessibility such as interest m politics The 
principal contribution in this area has been that of Philip E Converse,** 
and, m heu of a more appropriate model, certain hypotheses that wc 
shall test arc based on his work 

Converse argues that the probability of any given voter being suffi- 
ciently deflected m his partisan momentum to cross party lines m ^ 


Segal and Meyer, op cll pp 222-224 
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specified electron \anes direcH> as a function of the strength of short- 
term forces toward the opposing party and vanes inversely as a function 
of the miss of stored mfomntion about politics Moreover, the short- 
term forces that either reinforce partisanship or deflect the voter from 
Ills nornnl partisan course dqtcnd on the flow of current information 
The strength of current mformation flow will affect the magnitude of 
deflection from different parties at an election 

If the flow of information is strong no parJicuJar prediction is safe m the 
sense that the partisan valences of (he incoming information may be so 
varied lhatnci pressures toward changeare weak or in the sense that other 
cancelling processes may arise between individuals whose current reactions 
polarize If the flow of information is weak however, very potent Lmits on 
defeciions and vote oscillation are esiablisbed 

In the light of the postulated biasing effect of a context on the partisan 
character of information however, we would argue that where m 
formation flow is strong, defection will be influenced by any bias 
mhcfcnt m that information 

The informition intake of the voter is limited by two factors (1) the 
output of the system of communicaiion in society (2) the interest of 
the individual or his motivation to attend to information once some 
flow exists Further Converse claims that micrest shows a correlation 
with stored information, but its variation across individuals is much 
greater than that of current information intakes Converse suggests that 
a minimum interest level is required before information flows affect the 
individual s partisan attitudes At the lowest levels there is insufficient 
interest to attend to current information flow, at higher levels the 
individual is “protected from the impact of current information flows 
by his relatively large mass of stored mformation Converse postulates 
and demonstrates a parabolic relationship between partisan instability 
and access to information with the least accessible and the most 
accessible being the most resistant to current information flows In 
this paper a measure ofaccess to political mformation will be employed 
m order to identify contextual eflects upon individuals of varying 
susceptibility to the biased rnforntation flows of the locality 

A final critical omission of the contextual model has been Us failure, 
thus far, to examine the behavior of individuals vis & vts iheir local 
ambicnces over time If there is aflow of mformation which leads to the 
partisan conversion or reinforcement of individuals, then such attitud 
mal change ought to be revealed m panel studies over an appropriate 
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length of t,me In current contextual annlyscx .1 is 
discnnunate between (u) a case of information now leading to parman 
conversion or reinforcement and (6) a case of homopo ilical s'''''' ' 

“Homopolitical selectivity" is a term suggested by 
and refers to the tendency for like individuals consciously to seek 
other out for formal and informal interaction Himmclstrand dciincs 
homopolilical selectivity for both senders and receivers of po m 
information as the tendency of a sender (receiver) to orient 
mainly to receivers (senders) who arc known to have the same po i i 
stand as himself However, a political selection process might also 
result from the application of a social criterion to choice of friends, t c 
findings of political sociology suggest that a selection on the basis o 


social affinity would also be politically selective 
With cross sectional data, an alternative approach, operating wit in 
the assumptions of the contextual model regarding the biasing o an 
individual’s information field by the political milieu of a district, is to 
examine individuals who have lived m that neighborhood and have 
been exposed to that milieu for varying lengths of time The changing 
effects of a milieu with length of residence in that milieu arc open to 
speculation There arc two possibilities* (1) that the increasing time 
spent m particular political contexts leads to increasing intcgralion into 
informal groups and formal organizations m that area and hence to an 
increasing contextual effect via a flow of information with time spent m 
an area, (2) that an individual moving into a new locality searches that 
locality for individuals and organizations of similar ideology and view- 


point to himself This search procedure is a tnaUand-error process, 
and early results arc likely to be much more biased by the nature of the 
political context than are later outcomes Indeed, probability theory 
demonstrates that under perfectly random conditions where an individ 
ual IS exposed to a number of alternatives the expected wait or number 
of unsuccessful exposures before a given alternative is presented to an 
individual equals the probability of not being exposed to that alternative 


on any one trial divided by the probability that exposure will occur on 
any one trial Thus, in this particular case, let equal the probability 
of being exposed to a Democrat as determined empirically by the p^f 
cent Democratic in a precinct, and let Qrf equal the probability of being 
exposed to a Republican as determined empirically by the per cent 
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Republican in a precinct then in a completely two party situation m 
winch all individuals identify with either of the two parties 


that IS the expected number of unsuccessful exposures required before 
the individual contacts a Democrat So m i precinct that is 90 per cent 
Republican assuming randomness of contact and confinement of 
contacts to the preewet an individual would have to undergo on the 
average nine exposures before contacting a Democrat In a precinct 
that is 50 per cent Republican however an individual would have to 
have only one exposure before contacting a Democrat Nevertheless 
little IS known about space searching procedures in geography or in 
other social sciences and the area is entered with an almost complete 
innocence regarding possible relationships 


SOAfE HYPOTHESES 

The critical discussion of the contextual model just ouil ned has many 
implications only a few ofwbich cm be considered here The hypotheses 
selected for initial investigation are however central to further inquiry 
along the lines previously suggested on theoretical grounds 

H I Those belonging to informal social networks at the local 
level will be much more influenced m iheir partisan attitudes by the 
political milieu at the local level than will be those belonging to 
informal social networks at the extralocal level 
H 2 niosc belonging to formal social networks at the local level 
will be much more influenced m their partisan altitudes by the 
political milieu at the local level than will be those belonging to 
formal social networks at the exiralocal level 
H 3 The effects of the local political milieu will be increased for 
those initially establishing new links m a locality over and above the 
contextual effects present for those who have long since established 
such links 

H 4 The effects of the political milieu will be greatest neither for 
the least involved nor for the most involved but for an intermediate 


“See forexampJe ibcrcvcwofthsl iwslure nPcierR Gould Space^rch 
mg Procedures m Geography and Che fences Work rg Pa^n No J 
Soc at Sc ence Research Insi lute (Honolulu UnivcrsUy of Hawait 1966) 
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group that has rclatisdy low levels of current pohlical 

but sumcicnt interest to be susceptible to the biased political stiraut 

of the local environment 


DATA AND METHOD 

The dependent variable employed throughout this study is party 
identification This is an ordinal variable consisting of seven categoric 
with three gradations of identification allocated to Democrats an 
Republicans, respectively, and an intermediate category to Indcfwn 
enls The local political context is measured at the precinct level or a 
random sample of 604 respondents in the Columbus metropo itan 
arca^^ We have differentiated six categories of political context 
0-25 per cent, 26-37 per cent, 38-49 per cent. 50-61 per cent, 62-73 per 
cent, 74-100 per cent Other variables employed in this analysis are 
discussed where necessary as testing of the individual hypotheses 
proceeds 

The method of testing for contextual effects and for comparing the 
varying contextual effects to which difTcrcnt subgroups are subject is 
that recently employed by Putnam The independent variable pef 
cent Democratic m the precinct is related to the dependent variable 
party identification using Kendall s As Putnam has described it, 
“this statistic can be interpreted as -a measure of the gradient m the 
percentage of the Democratic votes among respondents as v/e move 
from strongly Republican to strongly Democratic counties Thus, 
when the social group is held constant, the magnitude of the coefficient 
can be interpreted as a measure of the sensitivity of a group to the 
political milieu, and the comparative sensitivities of different subgroups 
can be evaluated One of the problems associated with Kendall’s is 
that, although it is possible to test for the significance of any one 
coefficient, it is impossible to lest for the significance of a difference 
between two such coefficients Indeed, m many cases, where cross- 
tabulations involve four or five variables, sample sizes are reduced to 
dangerously low magnitudes In almost all cases, however, the pattern 

*’ The best available sampling frame was the Ilalnei Columbus City Directory 
which proved a reasonably reliable listing of the city population Students from 
classes in Public Opinion Metropolitan Politics and Political Behavior were used as 
interviewers All classes were given several days instruction m interviewing technique 
extensive telephone checking of intmiews revealed no cases of fabrication Some 
wiat more than 80 per cent of the interviews attempted were completed 
Putnam op ett 
Ibld.p 642 
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of findings is highly consistent tai6 the differences between coefficients 
clear cut Furthermore, the percentages appearing m the bodies of the 
^companying tables refer to the proportions identifying with the 
Democratic Party but ignoring gradations of intensity within those 
identifiers, needless to say, the coefficients have been computed on 
the basis of all seven party identification categories 

TESTING OF THE HYPOTHESES 

H 1 Those belonging to informal social networks at the local level 
will be much more influenced in thcir partisan attitudes by the politi- 
cal milieu at that local level than will be those belonging to informal 
social networks at the exlralocal level 

As Table 1 and the accompanying values of r, indicate, the partisan 
attitudes of those with local friends do seem to vary more as a function 
of the political context than do the partisan attitudes of those with 
nonlocal friends The difference m the values of t„ however, is a small 
one If this difference is a reflection of a converting or supporting effect 
of biased political information within a context, then the difference 
would be intensified when we turn our attention to those who discuss 
polities — that is, those most accessible to that information, be it 
biased or unbiased by the local political context 
Presumably, the difference m the contextual effect upon those with 
local and nonlocal friends, respectively, would be greatest for those 
who discuss politics One can imagine four kinds of respondents in our 
sample. 

(1) those who discuss politics with local friends, (2) those who don’t 
discuss politics with local friends, (3) those who don’t discuss politics 
with nonlocal friends, (4) those who discuss politics with nonlocal 
friends Hypoiheiically, if there is a transmission of information biased 
by the political context, one would expect the contextual effect to be 
most intense for those who talk politics with local friends and to be 
least for those talking politics with nonfocaf rnen*. wfttfe Chase who 
don’t talk politics should occujy intcrraedtaie positions As can be 
seen from Table 2, this is not the pattern that emerges m the case of 
informal contacts Indeed, instead of finding the greatest contextual 
effect for those who discuss politics with local friends and the weakest 
contextual effect for those who discuss pohiics with nonlocal fnends, 
the greatest difference m the coefficients occurs for the two groups 
who do not discuss politics at all 
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The burden of (he evidence presented here suggests that the areal 
scale of informal groups docs not have a differentjal converting or 
supporting effect on those telongmg to local and nonlocal informal 
groups respectively via a biased flow of information Not only is the 
difference m the contextual effect upon local and nonlocal groups very 
small, but holding discussion of politics constant does not lead to the 
anticipated intensifier effect 

H 2 Those belonging to formal social networks at the local level 
will be much more influenced in their partisan attitudes by the 
political milieu at the local level than will be those belonging to 
formal social networks at the exiralocal level 

The testing of this hypothesis poses two operationalization problems 
First, there is the problem of identifying a defensible allocation of 
voluntary groups to a scale of areal extent in branch catchment areas 
Second, any one individual may belong to organizations differing 
sharply in areal extent Here, m lieu of detailed knowledgw about the 
organizational characteristics of interest, voluntary organizations are 
classifled into these two categories 

1 Areally Intensive parent teacher associations church groups, 
local service groups 

2 Areally Extensive labor unions, fralemaJ organizations, veterans’ 
associations, political clubs, sports groups 

Then, if an individual belongs to any of the areally extensive groups, he 
IS allocated to the areally extensive category This does not preclude the 
possibility that the same individual might also belong to an areally 
intensive group, but this would be of no significance in categorizing 
an individual who belongs to an areally extensive group also If, 
however, an individual belongs only to areally intensive groups, he is 
allocated to that category 

As Table 3 shows those belonging to areally intensive organizations 
anr ciVairaciVnnoaf true of those 

with areally extensive affiliations Moreover, the difference in these 
contextual effects is much greater than was true of the differences m (he 
T, coefficients for those with local and nonlocal friends, respectively 
However, if we hypothesize a contextual effect fay which information is 
passed between individuals leading to conversion or support, we must 
first establish the relationships between context and partisan attitude for 
those who don t talk politics and for those who do, controlling for 
areal mtensivcness or extensiveness 
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If formal groups do exert a biasing cITect on the political conten 
one’s information intake in such a manner as to lead to conversion 
support of one’s partisan attitudes, then the contextual cfTect shou 
greatest for those -who discuss politics and belong to areally intcnsi 
organizations This can be tested for Uvo cases (1) discussion of politics 
with group members, (2) discussion of politics with friends The r«uii 
for these two analyses are presented m Tables 4 and 5, respective y 
The hypothesis is clearly upheld in this case There is a remarxa 
congruence between these two tabulations in the magnitudes o t 
analogous coefficients and in the rankings of the coefficients 
sivc-discuss politics, intensive-don’t discuss politics, extensive on 
discuss politics, extensivc-discuss politics Therefore, thus far, there is 
far greater evidence to support the idea of transmission of information 
in the case of formal organizations than m the case of informal organiza 
tions 

Indeed, it is possible to rank formal and informal organizations in 
terms of the contextual effects upon partisan attitudes as evidenced y 
the Tc coefficients (see Table 6) Table 6 suggests the existence of a 

Table 6 


Te Coefficients for Formal and Informal Contacts 
Inside and Outside the Locality 



Kendall s Te 

Areally intensive organizations 

342 

Friends in the local area 

265 

Friends outside the local area 

231 

Areally extensive organization 

178 


continuum with areally intensive organizations showing the greatest 
contextual effect, areally extensive organizations showing the lea^t 
contextual effect, and local and nonlocal friendships falling into inter 
mediate positions This continuum agrees intuitively with one definable 
in terms of the degree to which affiliations in the different groups are 
representative of the political context Putnam has suggested at the 
county level that informal friendship groups are likely to be far less 
representative of the political context in terms of their political com 
plcxion than formal groups,^* this appears to be true at the precinct level 
also Moreover, outside the preanct, informal groups appear more 
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representative of tbetr precinct of residence than do formn/ groups This 
js impressive because tt stresses the degree to which the contacts induced 
by forma! groups arc imoluntiry and therefore likely to bring the 
respondent into contact with a different range of political opinions than 
be might choose of Ins own accord 

H 3 That the cfTtcts of (he local political miheu will be increased 

for those initiallj establishing new links in a locality over and above 

the contextual effects present for those who have long since established 

such links 

It will be recalled that m our discussion of the contextual model and 
problems in its application thus far it was suggested that length of 
rcsidcncccould have two possible impacts (0 with increasing length of 
residence m an atca the mdividuat becomes tnctcasmgly integrated into 
the formal and informal networks of the locality, resulting in increased 
exposure to, and increased probability of conversion or support by, 
political information biased by the locality (2) with increasing length 
of residence in an area the individual broadens the political range of 
his informal and format contacts, a range which is likely to be highly 
restneted m the early stages of space searching in a biased political 
environment This second possibility resembles Hypothesis 3 In the 
tabulations that follow these ideas wr/f be tested byusingadichotomous 
definition of length of residence two years or less m one's current 
residence or more than two years 

Tables 7 and 8 suggest that for the local contacts of either a forma! 
or informal character recent residents arc more influenced by the 
context than residents of long standing in support of the second 
possibility, while for nonlocal contacts the contextual effect seems to 
increase with length of time resided in a precinct in support of the 
original hypothesis and the first possibility This is perfectly feasible if 
one interprets those with nonlocal contacts as being in a situation that 
protects thvm initially from the biased space-searching procedures to 
which those with only local contacts are subjected that is, the process 
of making initial contacts is much slower than for those who confine 
their contacts to the locality 

Moreover, if the tv coefficjcnts are rearranged as in Table 9, further 
relationships become evident First, it is clear that the ranking of 
formal and informal organizations at different areal levels in terms of 
contextual effects upon partisan attitudes »s apparent for both recent 
and nonrecent residents Second, there appears to be a homogenization 
or convergence effect for (hose contextual effects over time, there is a 
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Table 9 

Tc Coefficients for Formal and Informal Contacts, Inside and 
Outside the Locality for Recent and Nonrecent Residents 


Areally intensive organization 
Friends m the local area 
Friends outside the local area 
Areally extensive organization 


decreasing variance in the values of the coefficients as one moves from 
recent to nonrecent residents, although the mean vanes hardly ^ 
Third, the differential m contextual effect between those with local an 
nonlocal contacts, respectively, is obliterated for those with informa 
links with increasing length of residence but remains intact — thoug 
reduced — for those with formal links at different areal scales 
If these relationships for those with local contacts and nonlocal 
contacts respectively, are indicators of a political influence process, 
then they ought to intensified when the degree to which individuals 
discuss politics is taken into account Thus, the partisan attitudes of an 
individual who has just moved into an area and who makes local 
contacts and discusses politics ought to be much more afTected by the 
context than someone who docs not discuss politics Likewise, the 
partisan attitudes of a respondent who is a recent resident with non 
local contacts ought to be much less affected by the political context 
if he discusses politics than if he doesn’t Further, the rank ordering 
of the Tc coefficients for the four groups defined in terms of locale of 
contacts and discussion of politics ought to be preserved over time 
In order of descending magnitude this order should be local-discuss 
politics, local-do not discuss politics, nonlocal-do not discuss politics 
nonlocal-discuss politics 

Table 10 presents the cross tabulation for informal contacts, and it 
can be seen that these expectations are not entirely borne out If ^ 
political influence process was at work concurrent with the space 
searching procedures suggested earlier, there should be a decrease m 
the value of the coefficients with increased length of residence for 
those who discuss politics with local friends In fact, there is an increase 
Ukcwisc, for residents of long standing the highest coefficient should 
be for those with local contacts who discuss politics, and the lowest 
should ^ for those who discuss politics with nonlocal friends The 
table indicates that the latter coefficient is, in fact, greater than the 


Recent Nonrecent 
377 311 

303 233 

225 231 

101 225 
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former For informal relations, therefore, the coefficients do not 
reveal the pattern required to sustain the hypothesis of changing 
political influence patterns as a function of space-searching 

Tables 11 and 12 present the results for formal contacts Table it 
treats those organization members who discuss politics with t eir 
fnends, and Table 12 examines those organization members who discuss 
politics with other group members In these tabulations the hypot 
csized rank ordering is validated much more clearly than was true 
for informal links While the pattern is blurred slightly in the case ol 
discussing politics with friends, it is very clear in the case of discussing 
politics with organization members Those recent residents with oca 
contacts who discuss politics arc much more afTected by the biasing 
effects of the political milieu as they search space for secondary hn 
than are those who do not discuss politics Likewise, those recent resi 
dents who discuss politics with nonlocal contacts are very muc 
sheltered from influence by the local political milieu Furthermore, the 
convergence in the values of the coefficients over time is much 
for those who discuss politics than for those who don’t, thus in Table 
12, while the values of the r, coefficients for those who discuss politics 
decrease from 413 to 160 over time (a difference of 253 ), they decrease 
from 163 to 038 (a difference of 125) for those who don’t discuss 
politics Thus, not only are the apolitical who belong to areally intensive 
groups less alTectcd by the political milieu, but they are also less affected 
by the biasing effects of the milieu on space searching It is similar for 
the apolitical among ihose in areally extensive groupings — they arc 
more affected by the political milieu but less affected by the need for 
search for local contacts 


In summary, it appears that for those belonging to areally intensive 
organizations, the original space searching hypothesis is validated* recent 
residents arc more biased by the milieu For those belonging to areally 
extensive organizations, on the other hand, it appears that contextual 
influence starts at a low level and increases over time as a result of ^ 
retarded local space searching procedure in which the individual is 
sheltered from the impact of the biased information encountered m the 
early stages of the space search For informal groups, on the other hand, 
the pattern of ihc t, coefficients, as with the testing of other hypotheses, 
IS far too distorted to provide evidence for the postulated process 


H 4 The effects of the political milieu will be greatest neither for 
the least involved nor for the most involved but for an intermediate 
group that has relatively low levels of current political information 
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but sufficient interest to be susceptible to the biased political stimuli 

of the local environment 

Tlius far we have suggested that those belonging to areally extensive 
organizations are far less subject to the political influence of the local 
political milieu than are those belonging to areally intensive organiza- 
tions Further, we have found that these relationships change over 
time in relation to diflercnt stages in the space searching process The 
hypotheses regarding informal contacts, however, have not been sus- 
tained If the political milieu does exercise a converting or supporting 
effect upon the partisan attitudes of those belonging to voluntary 
organizations at different areal scales who are the most resistant, and 
who are the most susceptible'’ 

As set forth earlier, Converse has su^ested that it is neither those 
who are most informed about politics nor those who are least informed 
who demonstrate the greatest susceptibility to partisan change under the 
impact of current information flows but an intermediate group — a 
group that has sufficient interest to attend to current flows of informa 
tJon but insufficient stored information with which to defend itself 
against the persuasiveness of such contemporary material 
In this paper an attempt has been made to operationalize Converse’s 
current information continuum m terms of a Guttman scaling of 
certain items thought to be indicative of political involvement,’* in his 
original statement. Converse did use attention to actual media as his 
index of current information intake, but such data are not available 
for our Columbus sample Certainly, however, as shown m Table 13, 
the same pattern of greatest partisan instability for the mtcrmediale 
group docs seem to reveal itself In (his case, the measure of partisan 

instability isprovided by r, coefficients between the respondent s partisan 

attitude and his party preference in I960 and (he party identification of 
his father, respectively 

As Table 14 indicates, it is indeed in (he groups of medium political 
involvement that the conlcxtual effect is greatest when the areal level 


»* These items vfers ihe responses lolhc roltowmg questions, in Off" 
hardness (IJ ‘•When you talk wiih your friends do you ever lalk pub^ 
problems that is whaishappenmsmlhis^niTyoMnt^cny? W 

often do you geunto a conversairon about perfitics’* W 

know aro you more or Jess J.kcty than any of them » 

poliiics? (4) "Have you tried to consuice any^ f your 

week or so-* This produced a five-po.nt scale ™ 

however that rcJatively few respondRiu wenraRocaied to ihc first three cawgones. 

these were thereforo combined 
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Table 13 

T. Coefficients for Party iDEimncATTON and Party Prefer^ce 
I960, AND Father’s Party Identotcation, Controllino for 
Political Involvement 


Low Medium 

Imohemenl ImohemenI Imobement 


Party preference, 1960 500 485 

(N=n8) (AT = 193) 


Father s party identification 355 

(iV= 124) 


259 

(N = 2M) 


319 

(N - 79) 


of voluntary groups is held constant, indeed, for areally intensive groujw 
the value of the coefficients is particularly high, accounting for muc 
of the difference in the t/s for areally intensiNc and areally extensile 
groups, respectively 

From Table 15, however, it is again clear that this pattern is no 
repeated for more informal groupings While for those with nonl^ 
links the expected pattern is confirmed, such is not the case for those 
with local contacts 


CONCLUDING COMMENTS 

This paper has four noteworthy empirical conclusions First, it seems 
that formal contacts much more than informal contacts act as the 
channels along which political influence flows While predicted differ- 
ences in contextual effect have appeared for both informal and formal 
groups at different areal levels, these differences are intensified m 
magnitude and direction only when discussion of politics is held con 
slant for formal groups The respective cross tabulations in which 
length of residence and susceptibility are held constant also strengthen 
these arguments This is an important conclusion since previous work, 
particularly that of Putnam, assigned almost equivalent and compl® 
mentary roles to formal and informal organizations as channels for 
information flow Clearly, this contradiction needs to be taken up m 
further empirical work 

Second, this flow of political influence has a spatial component, so 
that for any precinct of residence those individuals who belong tn 


Putnam, op at pp 650-651 



Table ]4 

AL Effects, Orgamzatio'i McMBEuittPs, and Political Isnolvemcnt 


INFORMATJOV FLOW AND PARITSAN ATTITtniES 


183 




Political Involvement 


184 


KEVIS R <X)X 



McUuim m\ol\cmcnt 



INFOR^tATtOV now AND PARTISAN ATTITUDES 


185 


arcaHy more exicnsj\e organizations arc Jjkcly to be exposed to political 
opinion difTercnt from that toiAhich members of intensive organizations 
arc exposed It js probably for this reason that Putnam and Segal and 
Mc>cr obtained conllicting results m their respccine examinations of 
contextual cflerls on groups \aryiag m their number of secondary 
affiliations Certainly, the more conticts an individual has, the more 
likely some of them will be outside the immediate locality 
Tliird, there appears to be a space searching stage m the individual’s 
socialization inio a community This is a stage m which the individual 
IS trying to find agreeable contacts of a formal or informal nature m the 
local area and during which he is more likely to be exposed to radically 
different opinions than later m this trial and-error process All we have 
provided m this paper, however, arc some indications that space 
searching is involved in the socialization of an individual into a new 
community; applications of learning theory here might provide more 
definitive conclusions in the future 
Finally, we have attempted to incorporate Philip Converse’s measure 
of partisan instability into our contextual analysis While the surrogate 
scale for information intake employed in our testing of Converse's 
hypothesis is not the most desirable, the parabolic relationshtpsidentiffed 
between partisan instability and current information intake arc clearly 
reflected m (he relationship between contextual effect and political 
involvement scale for formal groups at different areal levels 
An attempt has been made in this paper to provide more definitive 
conclusions on the basis of a contextual model than have been provided 
hitherto By examining individuals characterized by varying lengths of 
residence, we have tried to minimize the dangers of inferring fallacious 
influence flows where a more realistic interpretation would be in terms 
of homopolitical selectivity Retrospectively, controlling for discussion 
of politics has performed a similar function Unfortunately, there is 
very little literature on the problem of contextual effects as a reflection 
of an influence flow process and/or as an indication of a selection 
process in the choice of friends Yet the problem of evaluating influence 
flow as opposed to homopolitical selectivity is not likely to be a diflicuU 
problem to resolve and should occupy a high priority in the research 
of those concerned with explaining variations in political behavior 
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Ethnicity, Religion, and Politics, in Canada; 
A Comparative Analysis of Survey and 
Census Data 


In its more common usage “ecological” describes the study of mter- 
actions betsveen individuals and Iheir environment In a more technical 
sense u also describes the use of aggregaie staiisucs obtained for com- 
parable territorial units tn order to determine individual behavior It is, 
then, a sort of hide-and-seek game one plays with obscure data, usually 
for lack of better information, sometimes for the pleasure one gets from 
drawing maps and from relating people and terntoo' 

Taken in its narrow technical sense, the ecological method is only one 
application of the use of aggregate to reach individual characteristics 
Tlicse "individual cbaraclcnstics" need not be “individuals”,' they 
may be traits contained wiibin an individual, individuals contained with- 
in a group, or a series of subgroups contained within a larger group — a 
territory, for example There is no necessary opposition bctiveen eco- 
logical and survey research As a territory is the environment of in- 
dividuals, an individual is the environment of trails, and to relate these 
traits through material obtained by interview leads to the same dilB- 
culties as the attempt to reach (he individual through census data 
Whatever diflerence there may be between the use of survey and of 


* The language is irescherous — one sbps easib from iFie notion of 'imiivicJuai ’ 
meaning “indivisible unit of aratjws" lo ifie notion of ' mdindual" meaning 
••people ’’HR Alker does « hi hts exeeHciM summary of ihe arguments against 
ecological analysis Set his ^fotbema^la wMt Potities (tiev/ York MacmiHan, 1965). 
p 62. 
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census data ss.ll come from the defimt.on of our units “f 

end analysis, of our definition of tshat is container and what is conten . 

of where the fish begins and where the fishbowl ends 

To contrast, as we propose to do, findings obtained fro 
poll sure eys with those obtained from the census is not to oppose sur y 
and census per se but simply to compare the advantages and 
of each kind of data for our very specific intent, which is to rel 
religious denomination and the ethnic origin of Canadians to po i 
Since our unit of analysis is the individual, the survey wil giv 
individual statistics, Vihile the census, most of the time, will yie ® 
aggregate statistics 


THE DATA VSED 

The choice of the election of 1963 as the pivot of our anai>sis needs 
to be justified The national surveys available to us spanned the years 
1953-1963, all had been taken on the occasion of a federal election 
Two of them, those of 1958 and of 1962, would haNC been closer to the 
time of collection of the census (I960) The election of 1958 was ru e 
out as a clearly deviant case, it marks the only time in Canadian history 
when a single party obtained over 75 per cent of the seats in Parliamcn 
In that >ear, partly as a result of the electorate’s desire to give a majonty 
to a party that had governed with a minority in the House, the Con- 
servatives progressed suddenly from their 30-35 per cent norm of t e 
popular vote, a norm they had maintained since 1935, to a high of 
per cent of the electorate That this election was a deviant case is showm 
by the elections that followed (1962, 1963, 1965) In 1962 the two major 
parties, Conservatives and Liberals, were at a balance point, each wit 
37 per cent of the votes, in 1963 the Liberals had regained a more 
“normal ’ lead of 40 per cent to 30 percent over the Conservatives, an 
this is the justification for our choice 

The poll survey used is that taken by the Canadian Institute of Public 
Opinion (CIPO), an affiliate of Gallup, before the election of Apnl 
1963 It has the advantage of providing a relatively large number ol 
cases {N = 2,710, of which 2,157 are originals and 553 duplicates), d 
suffers, however, from the disadvantages associated with the arca'<luol^ 
tjpe of sampling and from the further disadvantage of using categories 

* For a summary and further discussion of the ‘ Aggregation problem” m 
socul saences see O D Duncan, R. P Curzon and B Duncan, Sialisttcal Geof 
raphy Froblems in Analyzing Areal DataiGlmctx Free Press. 1961) 
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Of classification often badly adapted to the Canadian scene For example 
religious .ifliliation is given for Protcstani. Catholic, and Jew, a 
traditional United States classification Bui the Jewish classification is 
practically useless, even in a sample of ifui size, because ihere are too 
few Can idian Jews Inversely, the Protestant classification is loo broad 
The Anglicans, at least, should have been singled out, though they are 
not an established church as m Great Britain they have maintained 
much of the social and psychological altitudes that result from a 
pnvihged link with the Crown 

The census matenal we used is not available m the printed publica- 
tions of the Dominion Bureau of Statistics (DBS) but can be obtained 
from them in the form of computer pnntouts, which provide some 
demographic statistics by electoral district The usual demographic 
charactersitics were not all available m this special report, there were 
none, for example, on profession This imposed some restrictions on 
our choice of variables But even if all the usual information had been 
available, the census would still have frustrated us by its usual limita- 
tions The categories it uses to segregate individuals are evidence of the 
concern of economists and of bureaucrats in the hospital and coroners’ 
services rather than of sociologists It is unfortunately as unacceptable 
for a census laker to ask "Whai political party do you prefer’’ ’ or 
“How often do you go to church? ’as to have him say “I shall read you 
Iwojokes.you tell me which one, on the whole, you like best ’’ 

Of the electoral statistics used little need be said They were taken 
from the 1963 Chief Electoral Officer’s report, its only claim to dis 
tmction IS that for the first time m Canada it mentioned the political 
party of the candidates Although, as m the United States or Britain, 
voting IS, on the whole, either for a party or for a national leader, 
Canada continues to adhere officially to the theory that electors vote for 
individuals rather than for the groups that sponsor them But the party 
names that appeared for the first time in 1963 in the election reports 
have still to appear on the ballots 


T/f£ VARIABLES 

We propose to relate ethnicity and rdigion {a) to a social hierarchy 
measured by the level of edueatioit and (i) to the Jen and i.sht eon- 

Imuumasmdicalcdbypartypreference 

We expect, of course, to disprove the null hypothesis that ethno- 
religious groups arc found in equal proportions at ail levels of the 
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educational hierarchy and among all political electorates 
prosing the null hypothesis, vie expect also to be able to 
Native position of the groups we have selected in these educational 
political dimensions 


Dependent Variables 

To measure level of education, we shall use only two 
describing the two extremes of the pyramid* (a) primarj school e u 
tion and (6) university education . 

The politial variables are given by the five parties " . 

Parliaments of the 1960’5 the New Democratic Part> 

Liberals, the Conservatives, the Social Credit Party and the Cre i is 
The NDP is Canada's socialist party, it combines rather than blen ^ 
an agrarian tradition in the West and a Fabian-trade-uniomst 
ment in the major English Canadian cities The Liberals can be s 
compared with the Swiss Radicals, or the French Radicals o 
Third Republic, they are the “governmental ' party, par habitude i n 
par excellence The Conservatives, like the Socialists, have also a w^ern 
agrarian antibank, aniicity stronghold tradition but remain linked as 
well to the city business interests — a link that has weakened somctvha 
under the leadership of John Diefenbaker The two Social Credit parties 
can be best compared to the French Poujadists, they have, however, 
the embryo of an economic doctnne. a poor man’s v ersion of Keynes 


Independent Variables 


The Canadian population can be divided roughly into three major 
ethnic categories, if we define ethnicity by the country of origin of a 
person or of his ancestors on the father’s side There are 44 per cent 
British, 31 per cent French, and 23 per cent “other Europeans,” who 
are sometimes described as “New Canadians” in contrast to the two 


“founding races ” The remaining 2 per cent, which comprises Indians, 
Eskimos, and Asians, will be left outside the scope of this study 

Religiously, Canada is almost equally balanced between Protestants 
and Catholics, so well balanced that the immigration authorities ha'® 
ceased to publish and, theoretically, also to record the religion of ne^ 
immigrants As in Lebanon, official ignorance of changes in a con 
tcntious and precarious situation is thought politically advisable 

If the major Proleslanl denominations are counted separately, the 
Roman Cathobes dominate with over 45 per cent of the population, the 
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Ho numerically ncarwt competitors have only 20 per cent (the Umted 
Church)* and 13 per cent (the Anglicans) 

If we go beyond this gross chssihcafion and rcUie religion to country 
of ongm. we see the cthnic-rchgious mosaic so cfuracienstic of North 
America Table 1 gives the percentage of the tola! population of each 
of the 168 ethnic religious groupings recognized by the census Only 
three groups have more than 10 per cent French Catholics (29 per 
cent). British Anglicans (11 per cent) and British United Church (15 per 
cent) British Catholics, With ^percent arc the only other group having 
more than 5 per cent Thus four ethnoreligious groups account for 63 
per cent of the total population The remaining 37 per cent arc divided 
among a large number of small and very small groups 
Among major religious and ethnic groups reciprocal overlap is the 
rule, with one important exception A few statistics will illustrate the 
rule Only 84 per cent of Anglicans arc British, and only 25 per cent of 
British arc Anglicans, only 84 percent of Presbyterians are British and 
only 8 per cent of British are Presbyterians Only 25 per cent of Luth- 
erans are Scandinavians, and only 38 per cent of Scandinavians are 
Lutherans Over 10 per cent of Poles belong to denominations other 
than Catholic, Jewish, or Lutheran The exception is that of French 
Canadians If only 63 per cent of Catholics are French, 96 per cent of 
French arc Catholics This is a striking measure of the lack of pene* 
tration of the Protestant denominations into French Canadian society 
— an indication of the success of the “besieged fortress” mentality 
adopted by French Canadian society since the conquest If one con- 
siders the size of both the Protcslanl and the French Canadian groups 
and the statistical chances of social contact between the two groups in 
cities such as Montreal or Ottawa, the French Canadian Protestant is as 
odd as the Chinese Jew 

The ClPO poll survey did not allow much choice in the selection of 
ethnic and religious variables We shall use its "Protestant and 
“Catholic” categories and its eihnic-Jinguistic definitions of "English.” 
"French ” “Others (New Canadians),” which are based on the language 
spoken m childhood and suIJ understood The census classifications are 

morcrermed, we shall use only someofthem (see Table 2) 

The two instruments of research at our disposal have different 
qualities The survey has the great advantage of giving us individual 
statistics but has only gross and limited categories The census has a 

» The Uniled Church is the mult of a of Afethodisis. Congrcsahuoahsis. 

and a majoriiy of Presbyicnanj 




• trfor due to rounding - 0 23 per cent 

•• Less than 0 01 per cent 

Source Cenmj ( / Cuno / • IW/ Vol I, Part 3 
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Table 2 

,s=:”==^ 

ClPO Categories 

Census Categories 

B^nrednuEungespoeen 

English 

French 

French 

Other 

Other 

Erteefcounttyoforwn) 

British Isles 

Religious 

Protestant 

France 

Germany 

Scandinavia 

Ukraine 

HeUgious 

Anglican 

United Church 

Catholic 

Lutheran 

Jew 

Bapiisi 

Caihohc 


riiuiiTui^'y sw'p'i' 

refined 

survey appM” far ns should the doplma- 

census, int.wg lh= * ' L ecoloEicnl “"^^jness - use u •» 

'’“'-“''‘dr 

gorienl Unuuuons, 

ETnNIClTY. and 

Combinmglhesnm^^ Protestnn' ^ j„„.. Calhohc. For 

S;bo.,^rS?nund,nn--^'; 
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each of these categories let us determine the proportion of 
with at least one year of university training, a proportion that “ 
be read as the observer s chances of being right in saying that an ng 
Protestant or a new immigrant Catholic has university training t 

The trees of university training and low educational achievement 
presented by Figures 1 and 2 indicate first the differences between urban 
and rural areas then for each side of the tree the differences between 
linguistic groups, and finally differences between linguistic re igious 
groups The percentage in the upper left corner of Figure 1 m ica es 
that 10 per cent of adult English Protestants in nonmetropolitan areas 

have entered a university The sample percentages for French Cana lans 

have been multiplied by 1 75 to account for the fact that the first t rec 



’the French Canad an sample percentage has been multiplied by 1 75 to accouf** 
for the dilTerencc between En^ish and French universities 


Figure I 

Tree of University Education (Percentages) 
This figure IS based on aPO survey No 302 1963 
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fismt ^ „ ScnoouNO) 

TR.crrooAEacCADos(Us3T..ANC,o, 

r „.h Canato" aelmmon arc obtained m 
jcars “"'''"''^,£hoo\s”oulsi<ic tbc p„ple) and non- 

French Canada in _ metropoliW" of the tree since it 

The on •>=J“"X'ZT«as pu. « '1'’“ ItcIUnomio 

metropolitan (iDral ^,,hancith ^ f 

avas found to be a belter o ^ h„e E presented 

status (SES) But '‘^^^,,g,„„s proups n""'” “ „er educated than 
differences beWeen e „ ,he difference 

On either side of the metropobta university 

level, but twice as many dds. 

sun-ey material, m 
1963) 
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yond primary school ’ In the small-town and rural 
between the two “founding races” is even greater On the 'vh , 
French Canadians appear closer to the “New Canadians th 
dominant English group That ethnicity rather than religion is the lac 
causing these differences is indicated by the facts that in the metropolitan 
areas English Catholics and English Protestants have the same per- 
centage of university-trained people and that the Catholics o s *6 
belter m the rural areas Among “New Canadians there is a mar 
tendency for Protestants to rank higher than Catholics; but, here ’ 
very likely religion disguises an ethnic dinerence betvseen the immigran 


from Northern and Central Europe 
In an English-speaking country, one would naturally expec i 
dividuals whose mother tongue is not English to find educationa an 
social upgrading more difficult That there is a wide gap between Eng is 
and “new Canadian immigrants” is thus not surprising , more interesting 
IS that the French of Quebec rank so low compared to the relative 


standing of the French in other provinces , 

A companson of the two major Canadian provinces, Quebec an 
Ontario, confirms our previous observations (see Table 3) but mig 
lead one to think that the educational dominance of the English m 
Quebec is compensated by the dominance of the French m Onl^JO 
The 2-1 ratio m favor of the English of Quebec is almost reversed m 
favor of the French in Ontario 

However, while the educated French Canadians of Ontario are con- 
centrated in Ottawa and serve m a federal civil service that is theoreti- 
cally bilingual but is m fact almost entirely English-speaking, the Enghs 
Canadians of Quebec are concentrated in Montreal, they are business 
executives or managers and need not work through the language of the 
host community The French are in Ottawa as civil “servants”, the 
English are in Quebec as business “dominants ” In Toronto, the major 
English Canadian metropolitan center, only 3 per cent of the population 
spoke French as their mother tongue, while in Montreal, the major 
French Canadian metropolis, 24 per cent spoke English ® 

At this point in our analysis we need to measure each linguistic 


’ Compulsory education from the age of six to fourteen m rural areas and from six 
nu introduced in Quebec m 1942 Sec C b 

Phillips, The Development of Education m Canada (Toronto W J Gage. 1957) P 

• Fofstudies on the history of economic development inQucbecsee among others 
iheroninbuiionsbyA Faucher.M Lamontagne. and Falardeau in J C Falardeau 
1953)^^^*^ Coniemporatn (Quebec Les Presses de 1 UnivemtS Laval 
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Table 3 

PfRCCNTAGE OF PEOPLE »N SELECTED ETII'.ORELIGIOUS GroUPS 
WITJI UMVERSITV TRAIMSO JS ^^^•ROPOUTAN 

Ostario and Quebec 



Ontario 

Quebec 

English rroicjiants 

17 

24 

English Catholics 

12 

U** 

French Catholics 

lb 

• New CanaJian Protestants 

S 

• 

"New Canadian * Catholics 

7 

3 


• Too Tew caws <!«s than 25J 
•• After forreeiion (see note to Figure I) 
Source OPO poll No 30i April I9M 
Note In each cell Ihe maeimum is 100% 


group's rcsislancc to assimtWlon bp the other since we haveantici paled 
our rindings by assuming that the French become assimilated (at least 
Irnguistically) in Ontario but that the English do not become assimilated 
in Quebec, or at least not to the same ealeni 
Since the suriey lacks the data on which to base such an analysis, 
we shall use the census, and since the census Tails to correlate mother 
tongue and the ollicial language spoken, we must compare aggregates 
Before doing so let us seek lo duplicate part of our survey observa- 
tions In order to tod out whether ethnic and religious groups are so 
dislributcd that individuals may be described by 

went within the bounds of the electoral districts, we shall '“t'lMe 'h' 

“rlmi otomicrsily-lrained m each district with selected ethnic and 

religious , of Quebec (see Table 4) the simple correla 

Forlhcmclrop variable with university education 

r 1 T thatS French and consequently the Catholic stand 
clearly thay are the only variables with 

apart by their liig ,l,a o,her variables have such high 

negative signs "“h Table 5) that it B impossible to sort out 
positive single ,he Protestant denominations 

the new immigrants irom iw « 

among ‘hemselves Canadians" from the 

One could try, , ,j„l_ somshuig that can be done more 

eaXfoHtnguB^c than for denomioatmnal groups Our purpose is. 
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Table 4 

Ethnicity, Religion, and University Education 
IN Metropolitan Quebec* 


Ethnic (national origin) 

Religion 

69 

Bntish 

66 

Anglican 

French 

- 76 

Baptist 

61 

German 

87 

Lutheran 

85 

Scandinavian 77 

Catholic 

- 82 

Ethnic (mother tongue) 
French — 76 

United Church 

69 


Other than 
French or 
English 47 


• Simple correlation of selected ethnic and religious groups with the P**^^*^® 
of university trained per electoral dtstnci Only correlations over 40 are gi 

hovvevcr, fulfilled by drawing from the correlations both 
firmation and a warning We find it confirmed that English Canadian 
have a distinctly higher educational ranking than French Canadians 
The warning is that the “New Canadians" settle, within the Frenc 
host society, alongside the British, and that when the percentage o 
British increases, the percentage of ‘ New Canadians" tends also o 
increase Thus whenever we compare ‘French" and “British" districts, 
we should endeavor to control for the “New Canadians " 

Measures of Ltnguislic Assimilation 
To measure assimilation or resistance to it, we shall use two different 
ratios (a) mother tongue to ethnic origin and (6) official language to 
mother tongue We want to know, for example, how many of those who 
spoke English as their mother tongue are of British origin and how 
many of those listing it as their official language spoke it in childhood 
Similar data for the French will give a comparative measure of both the 
resistance to assimilation and the success in assimilating others 
The mother tongue-ethnic origin ratio for the French of metropolitan 
Quebec is 1 01 , for the British it is 1 12 If we assume that the English 
linguistic group has kept nearly all its original members or their 
descendants, a reasonable assumption, it has expanded 12 percentage 
points beyond its base, but the French have progressed hardly at 
A few French Canadians of non French origin are well known— 
former Prime Minister of Quebec and leader of the Union Nationals 
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Darnel Johnson, for exampk- but they rema.n sinking 

The French Canadian milieu has remained a very ^ 

The “New Canadians” have assimilated into English , 

than into French culture, even m Quebec For the whole omnada 
the ratios, as one should expect, are even more unfavorable to F 
Canadians , the French ratio is a low 0 92, the English ratio is 1 ii 
The ratio of official language to mother tongue should 

contemporary trends Its use is complicated, however, by t c a 

the census, in addition to the categories “French” and ‘ ’ 

also “bilingual ” We shall assume that all those who listed ^h^se 
as speaking “French only” had French as their mother tongue c * 
introduced by this assumption favors a high ratio of French 
resistance to linguistic assimilation since it implies that no Eng *5 
bom would have gisen French as his sole official language, w w 
probably very close to the truth, but it implies also that no c 
Canadian” immigrant would have listed French to the exclusion o 
English Since we want to show that the French have a low dc^ee o 
resistance to linguistic assimilation, the bias can only make the 
obtained more convincing The correlation between the ratio of rc' 
sistancc to linguistic assimilation and the percentage of 
Canadians in the electoral distncts of urban Quebec is given by Figure 
The ratio of resistance to linguistic assimilation through bilingualism is 
negatively correlated to the percentage of English people Not even 
Quebec City, which is geographical1> isolated from any large 
settlement escapes this trend Factors other than the number of Enghs 
settlers are, of course, responsible for the correlation upward socia 
mobility, education and wealth in particular Thus the better-cducatc , 
the wealthier the French Canadian, the more likely he is to base 
acquired English and to live m the same suburbs as English Canadians 
To reduce the disturbing efiecl of “New Canadian immigrants” who 
often list English as their sole ofHctal language, we shall restrict the 


♦ One of the consequences of this m»-ard looking and defensise attitude has txea 
that for French Canadian scholars, analysis has by and large meant self analyst 
English Canadians have studied French Canada, but not vice versa, as if ihe 
and the understanding of the dominant group by the minonty involved a <2'®^ 
reversal of psychological attitudes The result »s that v.e are relatively well provideo 
with studies on French Canada As guides through the social saence and histone^ 
literature on Quebec, see m particular Marcel Rioux and Yves Martin Frff’C'*' 
Society Sociological Studies fTownto McOelland & Stewart 1964) ana 
Falardcau (ed ) op at While Ontario receives about 50 per cent of all immigra"** 
to Carada Quebec receives only about 20 per cent According to recent cstima^ 
made by the Quebec Cultural Affairs Mmistty 90 per cent of immigrants to Quel«c 
seek assimilation into the BnUsh rather than into the French group 
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‘ “New Canadian” is used to describe the respondenis who indicated a mother 
tongue other than French or English 

HguTt 3 

Resistance of French Canadians to Lwcuisnc Assimilation 

analysis of ihc ratio of English resistance to assimilation to the electoral 
distncts with less than 5 per cent of "New Canadians” m their popula- 
tion. Although the number of cases remaining ts small, the indication 
IS clearly given by Figure 4 that the ratio of English resistance is much 
higher than the French and hardlyafrcc<edbythesireorthe‘'opponent " 

It IS only when the French Canadians in the district arc more than 90 
per cent of the population that the ratio falls under 0 75, and even then 
It falls only to about 0 50, a level obtained with far less English popula- 
tion pressure in the case of the French 

ETHNORELIGIOUS GROUPS RELATED TO 
POLITICAL PARTIES 

The reinforcement of the major linguistic and religious divisions by 
educational and geographical differences would seem to predispose 
Canada to a communal as well as regional partysysfem, a system where 
the major ethnoreligious commumiies would anchor their claims in 
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Peru. nt3{e claiming I umlt os niollKf lontut in riJint 

• “N«w Canadian" is used to desenbe the respondents who indicated a mother 
tongue other than French or English 

Figure 4 

Resistance of English Canadians to Linguistic Assimilation 
(in ridings with less than 5% New Canadians) 

difTercni parties One might expect there to be a major French and a 
major English party, since it is between these two groups that there is the 
least amount of linguistic, religious, and geographical overlap However, 
despite the growing economic and political conflicts between Quebec 
and Ottawa, the party system, though affected, has maintained, on the 
whole, Its multiethnic characteristics The heightening during the past 
ten years of the Franco-Enghsh opposition has strained the con- 
stitutional and party structures but has not thus far fundamentally 
altered them 

French Canadians have found sufTicient protection in the federal 
system to continue to support federal parties, although the latter arc led 
by and from English Canada, even when a French Canadian is at the 
head Only one of the five parties fails to link at least two of the major 
ethnoreligious groups The Quebec Criditistcs arc exclusively French 
and consequently Catholic 
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W.A. .s .he e.hnore.,s,ou. 

How does each of the major cthn S* answered E\en if we 

,.een these part.es’ The two „r fte nat.on, we 

were to find that a j' ' ^ best political agency of ethnic and 

still could not conclude that it is t 5 ^^ „f 

retigious conciliation unless we know its sire as 
the groups it mirrors 

The hucrml Bhmrehs>o«s ,b"tThe defini- 

At the outset of this ““'^'^J[,“I,1!,^,spurelylinguistic “English" 

tionofethnicitywederivedfrom hesu g or 

and "French" categories refer t^'ha on, describes 

French as their mother tongue 

, hose who had a mother .ongue^^ rton^^,,_,„ 

The degree of under- or ^ f 0 pa„y by a gi'cn ethno 
given by Table 6 The index "f '“>’^,'o„ai percentages For example. 
feligious group is a ratio cf P“‘l' “ " 58 per cent English 

the New Democratic ot„, 0 / English Pre'"'!''’!' '^pp 

Protestant new Canad.aos ( Cathotic column TheND 

servatives have low scores in the 


Table 6 

o^' 

,Ere 

CoinW'***’'* ji— ^ 


Canadian Party 


S CiiiJ I 



French 

TmiTTTn. 

o.hcr /bZ/is; 


Cath I’f®* 


ciropoiiNo 30a Apntiae 


Source 
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,he Socal Creditors are svell balanced bott 

Canadians, svhelher they areCathol.e or Protestant, but ‘h^y 

very weak in their representation of the most important minori y 8 P 

the French Canadians The Liberals are thus the only P^^y “ 

close to being the microcosm of the nation, they underrepr 

the “new Canadian" Protestants, who account for only 6 per cent o 

sample 


The Pohitcal Preference of Ellmoreligious Groups 
The prime importance of the Liberal Party as an agent 
religious aggregation appears esen more clearly from the study o 
balance of political preferences among the ethnoreligious groups c 
sidered(seeTable7) ,nnorl 

Neither ofthe two major parties receives less than ISperccnts PP 

from any of the groups, a level that is rarely reached by the sma ^ 
parties, both major parties can thus qualify as political 
aggregation and conciliation if not integration But again the u f 
Parly is the only one not to be unbalanced, it alone obtains massn 
support from both the French and English communities . 

Ethnic and religious conciliation is thus a characteristic of the poll ica 
center As one moves toward the extremes, the parties take a mor 
specific ethnoreligious form .. 

Schematically we can represent the Canadian party system ^ ^ , 
shaped curve skewed to the right of the ideological center Close to t a 
ideal center we find the point of greatest electoral support, the tra i 
lional” government party and the point of greatest political conciliation 
and amalgamation of the various religious and ethnic communities 


Table? 

Percentage of Support Given by Selected 
Ethnoreligious Groups to Canadian Parties 


Con*er S Credit S Cred t 

NDP Lberab vatiTcs fEng.Canl (Quebec) (roun ded) 


French 1 

3 


1 


3 


H 


3 


100 


En^iih 


I 1 

64 1 

1 

17 

42 

3 

H 

3 

0 

100 


Other ^ 
Unfujj* 

E 

1 1 

50 

l-’l 

iX 1 

40 

n 

18 

0 

0 

100 

100 


Cath. Pro! 

Cath 

Prot 

Cath 

Prot 

Cath 

Prol 

Cath 

PfOl 

Cath 



Source CIPO poll No 302 
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large cities because of the g , E„„ ,f the Conservatives 

in the nonmetropolitan areas ( (Creditistes) m 

The Metropohtm """"'""^“''jj™censu5 data, and m 
To further this analysis ''‘ ”’ "° „„ydefinitionof metropolitan 

order to match as nearly as pass, bleour survey 



•votes obtained ,n 1963 

_ c»nadian Party 

eiioiiaTic UNbtu to ™ 

E™uo«uaious Oaoue .v B*™ 

(.,==vcryitnwigl««.'^*“ 


System Showing 

Protestant 



Preference (Percentages) 
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areas, let us separate the 77 most urban djstncis from the others In 
both areas let us determine for each party the most relevant ethno- 
religious variables by means of a stepwise regression analysis At each 
step the variable added to the equation was that which caused the 
greatest increase m the coefficient of determination in other 

words, the variable that had the hipest partial correlation on the 
dependent variable partialed on the variables already in the equation 
The variables tested were the following (they all refer to the perceniage 
of people described by that variable in each electoral district)* 


Religious 
Anglican 
Baptist 
Lutheran 
United Church 


Ethnic (defined by the nationality of origin) 
British 
French 
German 
Scandinavian 
Ukrainian 

Ethmc (defined by the mother tonjne) 
French 

Other language 


Immigration 
Postwar immigrant 


Education 

Less than five years of formiU education 
University education 


We retained only the variables that produced at least a 2 per cent 
point increase m the il' or had a sismlleanl change m R (»e used a 0 OS 


■■ In the order m .h.ch the, am l«rf hy the Ctnef Ore, oral Mieer-. report the 
meiTopolitan distncis are as fotlow lo jo 5i 22 

Drtnsh Columbia' J. Z 4. 6 , 12. 16 . 7. «. 2». ^ 

Alberta S.b, 7,$.9 Saslaic^^ 75 76.7s 

Ontario- ^ 35^37 3S. J7.5#.59. », 61. ^63.64. 65. 
79. SO, 81,82.83, M. 83 Quebec I9.z7,«,a». ■» . 

66 . 67 , 68 , 69 , 70 . 71 . 72 . 73 . 74 . 75 


Now Scotia 7. 
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s.gn.ncant level m Ihe f test) The variables retained are 
order m which they appeared in the equation for tnetrop 
nonmetropohtan areas studied separately (see Tables 9 to 12) 


Table 9 

Multiple STEP-nv-STEp REGRtssio«i or NDP Electors (iv %) 
ON Percentage of Selfcted Ethnic ano Religious Group 
PER Electoral District* 


Metropolitan Districts {N ~ 77) 


a RcCTCSSion equation (nonstandardtzed) 

% NDP = 36 67% -( 24) (% Catholic) -(1 89) (% university 
education) 

b Regression equation (standardized) *• 

NDP = 3 13 - 62 (Catholic) - 48 (univ cduc) ri,-.liiesat 

c Order in which the variables entered the equation with R and r 
each step 

lststcp«Catholic(r«24 9). 25 ^ . .. 

2nd step *= Catholic (F= 503) + umv cduc (F=® 296 ),a “ 
d Simple correlation of each variable in the equation with party 

Catholic ss - 50, univ cduc = — 40 

Noninetropoltian Districts (A( = J 88) ^ 


a Regression equation (nonstandardized) , 

%NDP«6098%- 58 (% University education) -f 38 (%Anglican/ 
“( 98) (% postwar immigrant) 

b Regression equation (standardized)** - 

NDP » 2 05 - 68 (univ educ)+ 28 (Anglican) - 39 (postwai 
immig) . 

c Order in which the variables entered the equation with Fand F* values 
each step 

1st step = univ cduc (/- = 162 4), F* = 46 

2nd step « univ educ (F = 272 4) + postwar immig (F— ^ 

F* = 60 

3rd step = univ educ (F= 195 1) + postwar immig (F= 77 8) + 
Anglican (F= 35 1), F* = 66 
d Simple correlations of each variable with the party 

Univ educ = — 68, postwar immig = — 14. Anglican — _ 


• Wc stopped entering variables in the equation when the stopped 
by more than 2 or when the Fvalue of the variable to be entered was not signitica 
at the 0 05 level 

•• In the standardized equation the vanables are measured m units of their owi* 
standard deviations 

Source Census data 1961 and elcrtion data 1963 
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Table 10 

Multiple Sirp-nv-STEP Regression of Lideral Electors {in %) 
ON Percentage or Selected Ethnic and Religious Groups * 
PER Euctoral District* 


bfflropohian Dssincft (N - 77) 

o Regression equation {nonslandardized) 

% Liberal = 41 78 — 3 76 Scandinavian) 4 - 3 7 {% unive/sily 
education) — l02(%Gcnn3n) t 22(%olhcr language)** 
b Regression equation (standardized) ••• 

Liberal * I 13 - 68 fScand ) ■» 49 (univ edoc) - 37 (German) + 
25 (other lang ) 

c. Order in which the variables entered the equation «iih /^and R* values at 
each step 

1st step =« Scand (r= 58 5), R* - 44 

2nd step =» Scand (F- 1205) + univ educ (F- 356), R* « 62 
3rd step ** Scand [F = 51 3) + univ educ (F - 35 9 ) German 
R** 65 

4th step = Scand (f' 47 3)4 univ educ (F 451) * German 
(F=r 13 0) + other lang (r 104). 69 

rf Simple coiTclaiions of each variable with the party 

Scand « - 66, univ educ * 40, German - - 58, oihcrlang. 

- 03 


NontnetropeUtan DmrteUiN •» 188) 


G 


b 


Regression equation (nonsiandardizcd) 

% Liberal = 8 91 4 - 87 (% French) - 86 {*/, university education) 
+ 2S2.9 {% Ukrainian) 

Regression equation (standardized),*** 

Liberal* 24 + 90 (French) - 22 (umv educ) f 14 (Ukrainian) 
Order in which the variables entered the equation with F and F* values at 


each step 

1st step = French {F - 1034 7). R* 84 
2nd step - French (F - 1444 7) + univ educ (F 
3rd step = French (F - 1652 0) + univ educ (F » 
(F* 24 6). 91 

Simple correlations of each variable with the party 
French » 92, umv educ 86, Ukrainian w 


96 3). - 89 

97 0) r Ukrainian 


We stopped eocenns >«raWes m ite "I™™ ZmS 
by more ItoiT? or »»eo Ihe f «lue oflM 'trabk <« bo n’lor'J 
at the 005 level 
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Tab/e H 

Multiple Step-by-Step Regbessios of Conservati'.e Electors 
( i'< %) ON Percentage of Seltcted Ettimc and Reugiols 
Groups per Electoral District * 

Metropolitan Districts jS = 'll) 


Regression equation (nonstandardtzed) . , 

% Consen-ame-559-r 84 (% United Church)-' 96 (/, Jow 
education)** - 2 95 (% Scandinavian) - 88 (% Gcnnan) 

Regression equation (standardized) **• j At 

Conservatne = 54 J- 1 02 (United Church) - 41 (low educ) 

(Scand ) - 37 (German) , 

• Order m uhich the variables entered the equation with F and a 
at each step 

1st step = United Church (f - 30 5), R* *= 29 
2nd step = United Church (f - 390) - lowcduc.(/’« 9J), a _ 

3rd step « United Church (F * 60 6) - low educ (F = 

Scand (F= 15 8), 48 

4ih step* United Church (F=351 )t1cw educ. 

Scand (F*24 7)-German(F*89),R»* 54 
d Simple correlations of each vanable with the part> 

United Church * .54, low educ = - 19, Scand * 19* Gcnnan - 
JO 


= J7 
r 17J) - 


h'onmetropohtan Districts (^' = 1 88) — 

a Regression equation (nonsundardized) . . 

% Conwrvative = 10 S9 + J8 (% postwar immigrant) — 2.73 (4 " 
bom m Canada) ^ 2009 (% Ukrainian) - OS (% French) - *3 L/* 
Anglican) — 16 (% Baptist) 

b Regression equation (standardized) ••• ^ 

Cons = 92 — J8 (postwar immig.) — .24 (not bom Can ) — -34 
(Ukrainian) — .26 (French) — .24 (Anglican) — .22 (Baptist) 
c Order in which the vanables entered the equation with Fand R* values a 
each step 

1st step - postwar immtg.(F= 79 J), R* = .29 „ 

2nd step = postwar immig. (F= 82.9) — not bom Can. (F=* 47 2), 
R*= 43 

3rd step = postwar immig, (F = 75 2) not bom Can (F =" 68 7) 
Ukrainian (F = 44 9), =* 54 

4th step = postwar immig, (F = 72.4) ^ not bom Can (F = 39 8)- 

Ukrainian(F-41.5)+ Frcnch(F= 161),R* = 58 

5th step = postwar immig. (F = 82.8) - not bom Can. (F = 24 7) 

(F = 44.2) - French (F = 26 4) -r Anglican (F= U.5), 

6th step = postwar immig. (F = 66 9) + not bom Can (F =218)- 
Ukiainian (F= 347) - French (F= 23.5) -f Anglican (F= 21 D ^ 
Baptist (F= 18 0), R* = 64 
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, S.mplecorre.a.,o„,or=achv.nabk-.hte>^^^^^ ^ 3,, 

Cm ik;» j?* cfoDPcd incrcasins 


bymoreuiu.i - 

^'so„“S‘‘S«s"dala .961 3ndd«.»nd..a .96) 

Table 12 

MULTlFUi STIPBV-SnF AWRH-.=>™S 

,, 0, 


their O'An 


.St - 56. French 77:77^ 

Bap..so a- 3 (•/. " 

V Social Creditor “ 3 (V CaUtolid 

C Order in which ihe « 
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Table 12 (continued) 


d Simple correlation of variable with party ^ 

Baptist ^ 45, postwar immig — 39, Anglican - 
- 36 


03, Catholic = 


Metropolitan Districts in Quebec (A — 25) 


a Regression equation (nonstandardized) 

% Social Creditor = - 8 74 4- 40 (% French) 
b Regression equation (standardized) •* 

% Social Creditor - 72 -r 79 (French) 
c andd f value = 40 5, /i* - 63, simple co rrelation (r) - u 

Nonmetropohtan Districts in Quebec {N = 50) 


a Regression equation (nonstandardized) 

% Social Creditor =- 94 + 47 (% French) 
b Regression equation (standardized) •• 

Social Creditor = 67 + 44 (French) 

c an d d Fvalue = 12 1 , = 20, r = 44 ^ 

• We stopped entering sanablcs in the equation when the K* stopped 
by more than 2 or when the Tvaluc of the variable to be entered was not sig 
at the 0 05 level r their own 

•• In the standardized equation the variables are measured m units ot me 
standard deviations 

Source Census data 1961 and election data 1963 


On the whole, the regression analysis confirms the survey findings 
The NDP s success is negatively correlated in the metropolitan centers 
to Catholicism and to university education A regional analysis 
that the negative correlation with Catholics remains in Ontario n 
disappears in British Columbia and Alberta, a qualification that a so 
confirms the survey data (See the appendix to this chapter ) 

In the metropolitan centers the Liberals are positively correlated wi 
uruversity education but negatively correlated with Protestant ne'V 
Canadian” ethnic groups (the Scandinavians appear first in the equation) 
In the nonmetropolitan areas Liberals are highly correlated with the 
percentage of French people 

The association of Conservatives with Protestants and nev/ 
Canadians” is confirmed The equation indicates in particular that in 
the rural areas the Ukrainians a non Protestant ‘‘New Canadian 
group, arc positively correlated with that party 
The variable “French” alone explains 63 per cent of the Socia 
Creditistes variance m the urban areas of Quebec and 20 per cent m the 
rural and small town areas, this indicates that the support for Socia 
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Credit in the metropolitan centers wts among French Canadians more 
e\enly distributed geographically than in (he rest of the pro\ince " 

The fact tint m the ttxstem English Canadian provinces Baptists 
enter the Social Credit equation first, in metropolitan as well as non 
metropolitan areas, confirms the link between religious fundamentalism 
and the western brand of Social Credit The negative correlation of 
Social Creditors with Catholics is a further indication of the homo- 
geneous Protestant character of that party m the West, at least m the 
rural areas 

Among the correlations presented m Table 11 one claims special 
attention because it was not anticipated Where we had expected 
“Anglicans" to score m the Conservative column, “United Church" 
appeared We hypothesized that this might have been due to the weight 
of Western Canada and (hat if \vc were to analyze separately Ontario 
and the West, we would find Anglicans a better predictor of a Con 
servative vote m Ontario and United Church m the West The hypothesis 
was not substantiated The correlations with Anglican' were not 
significant in the West they became positive m Ontario buiremamcd 
insignificant The United Church on ihe contrary had a high positive 
correlation in Ontario ( 63) Maybe the icrntonal distribution of the 
two religions is such as to tempt us to accept an ecological fallacy ’ 

We feel reluctant to accept the suggestion that United Church member- 
ship is a better predictor of a Conservative vote than is Anglican Church 
membership But it may well be that the folklore is out of date on this 
point Twoofthrccsmall scalcsurvcysmadeinreccniyearshaveshown 


For an ecological sludy of ihc Social Cred t »oic j» Quebec in 1962 and J963 
see Vinccnl Lcmieut L<» dimensions sociolog quts du vote cnfdmsie au Quebec 
Recherchet Soewgraphques 6 2 <»M5) pf» J8t 195 The Social Cfcdil suewss 
appears linked to ihe size of ihe communiiy its votes being Ihe highest in com 
mumlies of I 000 lo 5 000 lohabilanis See also M Pioard Polecat Factors in 
Ihe Rise of Ihe Social Cred i m Quebec (paper presented (o ihM9M mecimg of the 
Canadian Politicat Science Aaoc«lion) and V Lemeut The Elcciiws m the 
ConstiluencyofLeiis mJ Meisel (cd) Fo/wm on fAe /MZ £yecfwn fToronio 
University ofToronlo Press 1964) t ^ ^ , • 

'■ Nor should our bo loiupltd lo atoopl wiihoo! cauf.on Itic h.,h oorrtlalions 
obtained m our nnatjsis which -as mlendcd simply to idem Ty Ihe major variables. 
As Rohineoo oomled CTJI ccoIOEical correlations lend lo esaggerale the correlations 
^blmSoTSSn derogmph,/™n.bta at Ihe level of mtoidnal, W S 
Robinson Soc cal Correlalioni and Ihe Behavior oT Individuals " 

Kooinson tco £ .,0,0501 «« J5i 357 For suggesled concctiom of ihis bias 
Rnresswns and the Behavior of Jndiuduals,- 
T A Goldman ^ a dscussion and 

ap^^Sion rA^R Raggally 

19W A»encanPof<tica/Sa(^cfRf»^ 36(1962) pp 66-70 
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a sl.ghtly greater support of the Conservafyc party by ^ 

Church than by Anglicans John Meisels study "> 

and that of Grace Anderson m Hamilton (also in 0"“'''°' 

cases the authors felt reluctant to generalize from local observatio , 

It is possible that the local situations evidenced an evolution " 
affected not only the Conservatives but the Canadian party y 

generally — a movement of the Anglicans toward the center 

Should further research establish that the traditional link 
Conservatives and Anglicans has been replaced, even in Ontario, y 
preferred link between United Church and Conservatives, we 
have to revise our belhshaped model of the Canadian party 
Close to the ideological center, which cuts through the Libera ar > 
we would locate the ideal point of better representation o 
“historical” churches, the Anglicans and the Catholics, moving . 
the extremes, we would then see a greater saliance of the more rece 
denominations Admittedly, the Quebec Credilistes would not nt wi » 
such a model But in Quebec as m France, the Poujadists are no c 
pcctcd to last Should they last, however, or should French Cana 
develop other specifically French parties, it might become increasing y 
difficult to use the same model of explanation for the link 
ethnoreligious groups and political parties m both Quebec and Eng is 
Canada . 

Our hesitation to accept the suggestions offered by the ecologies 
analysis, or our reluctance to explain the individual by the group 
so doing violates the folklore, is evidence of the serious limitations oft e 
method used But it could be argued also that, had we not tried t is 
method, unsatisfactory as it is, wc might have not met the problem at a 


The expected high correlation between Anglicans and Conservatives is reported 
in a systematic sample survey made by P Regenstrcif, Some Aspects of National 
Party Support in Canada Canadian Journal of Economics and Political Saen 

ICJEPS) 29(1963) pp 59 74 higlier support by the United Church is reported i 

John Mciscl, • Religious Affiliation and Electoral Behavior, CJEPS 22 (1956) P 
486 and in Grace M Anderson * Voting Behavior and the Ethnic religious Van 

J2(1966) pp 29-37 For a discussion of the religious factor in I 

1957-1963 ele«ions sec P Regenslreif. The Dlefenbaker Interlude (Toronto tong 
^ns. 1965) ChapicrV For a historical analysis see S D Click, Church and Seci 
In Canada (Toronto Toronto Uiuversity Press 1948) 
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English Prot 
Engh&h Calh 
French Calh 
0-L Prot 
0-L Calh 
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DAVID R SBGAL AND MARSHALL W MEVER 


The Social Context of 
Political Partisanship 


CetW'ECN the INDIVIDUAE and the socielyof which he is a mem her, there 
exjstsamuUitudtfof primary and secondarysocial groupings that define 
his place in the social order and demand certain behaviors of him 
Thus conceived, primary and secondary group memberships at once 
condition a person's relationships with others — alTecting the choice of 
his associates and the roles he takes — and at the same time exert a 
normative influence by requiring certain attitudes and actions and 
proscribing others These mediating groups serve several integrative 
functions forthc individual and society First, theymtegrate theindivid* 
ual into society fay placing constraints on his behavior Durkheim 
showed, for example, that members of cohesive mediating groups — 
such as married people and members of the Catholic Church — were 
less susceptible to suicidal currents at the societal level than nonmembers 
or members of more loosely knit collectivities — unmarried individuals 
and Protestants ‘ Second, mediating groups with overlapping member- 
ships may sene to integrate society as a whole by limiting cleavages 
along class, sectional, or ideological lines Simmel, Coser, and Coleman 
have explored this issue in their dianissions of primary, formal organ* 
izational, and community structures ^ 


* Emife Durklwim Suicii/e (Gtcncoc Free Press 1951) p 145 

* Georg Simmcl. Conflict and the tt cb of Gnwp AffihanonHGkncoe Free Press 
l955),LcwisA Coser The FunctionsafSoeitdCoi^HCjlensxie FreePress 1956), 
James Coleman. ComniKnny Conflict (Clencoe Free Press, 1957) 
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KORNHAUSER and suits ON POLITICAL PARTICIPATION 

William Konihauser has suggested that a "''^“,',^1 

otganizations may be necessary for the , h, 

integration of society Where such “f 
argues, the public is available to manipulation by 
of mass apathy and lack of participation exists Such co 
definitive of mass society in Kornhauser s formulation 
dilTerent view of mass society is suggested by Shils, who 
society can provide the conditions for increased politica ^ ^ 

by diversifying the potential bases for political participa 
present paper suggests that each theory has its clement 
We propose that the juxtaposition of Shils’s notion of . 
bases of involvement with Komhauser’s concept o 
organizations presents a more realistic and less 1^,5 

mass society than Kornhauser suggests, while not re u i 
basic hypothesis . -u„ 

In terms of Kornhauser’s formulation, a strong intermedia B ^ ^ 
structure is necessary to preserve pluralism in^ society Lacking 
strong intermediate structure, the individual becomes ^®*P®^** 
manipulation by the elite through political organization and t e 

media nnzs 

Kornhauser conceives of mtcrmcdiatc structures for the most p 
voluntary organizations Differentiation of the labor force, he 
IS such that union and professional groups are now needed to 
integrative function formerly accomplished by informal work asso 
tions In the same manner, the complexity of the modern communi y 
such that only organized and vocal interest groups can pre 
accessibility of nonehtes to elites* In more operational , 

interpret Kornhauser as saying that individuals who lack attachmen 
voluntary organizations are open to political control and domination y 
the national political elite The unattached person is viewed as root es , 
devoid of lies to any political community, and of transient and incon 
sistcnt beliefs Kornhauser, then portrays the unattached person as 
the apolitical man 

It IS our contention that this is not the case While we do not mean to 
refute Kornhauser’s basic hypothesis that the absence of mediating 


’ W.Uiam Kornhauser, The Pohites of Mass Society (Glencoe Free Press 
pp 60-73 

* Ed«rd A Shils “A Theory of Mass Society, Diogenes 39 

* Kornhauser op elt pp 76-77 
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associations leads to apathj. ive reject his Jimitcd operational definition 
^ mediating association and argtic that forms of social retaiionslup 
besides voluniiry organtaaiions may serve this mediating function 


OWER THEORfES OE PARTICIPATION 

It has been demonstrated that diverse kinds of nonpohtical organiza 
tions can serve to integrate (he individual into the political community ® 
Several studies have indicated the importance of primary group rela 
tionships in (he formation of political aliiiudes Tliesc findings show that 
people m large measure pattern their political beliefs after those of their 
friends, family, and coworkers ’ 

A major preoccupation of research into American political behavior 
has been the relationship of social class and political partisanship 
Surveys of national samples of the elcctoraie have consistently shown 
that, although there arc regional differences people of high socioeco 
nomic status (end to support (he Republican Party and persons of low 
socioeconomic status tend to support the Democratic Party ’Similarly 
people who perceive themselves as middle (or upper) class tend to call 
themselves Republicans in much greater proportion than (hose who 
identify with the working (or lower) class ’ We do not intend to discuss 
at length the connection between social class and political choice It 
Will suflice to note (hat cither self interest or class interest may explain 
the relationship between low socioeconomic class standing and prefer 
ence for ‘ left ’ political parties as well as its converse the relationship 
between high social class position and right political preference 
Hicse studies drawn from national samples have identified a number of 
individual level correlates of political choice — for instance, age and 
religion But national surveys by drawing from a very dispersed sample 
of respondents (a few respondents at each of the great number of 
sampling points) have been unable to account systematically for the 
effects of the social context in which a person finds himself, his immediate 
environment, on his political choices The effects of local community 
structure or locally based formal orgamtations cannot be deduced 


- See Scott Or«r and Peter Orleans TUe Mia Sociely and tht Parapol t cal 
Siruciure American Sociological Ketiey 27 (1962) PP ^ •, MrPhw' 
"Sec for example Bernard Benrison PaulP and William N McPhec 

^'•r“ hS rn-Mon U"'««r 

siiy Press 1949) 
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from data .haf .gnorc fhesc var,abl« Other studies 
have concentrated in single communities While such des E 
analysis of the ctrccts of organizational membership, J 

comparisons have no. been possible Thus, 'h' 
hypothesize a “breakage cfTect” - that is. all other of 

people will tend to vote for the party supported >’> 'h' 
opinion of the communities in which they live But the f 
elfeet is observed in Elmira, Nevs York, docs not allow us to gen 
It to the whole of American politics . o 

Another approach has recciscd a great amount of attent 
Study of voting behavior in the United States In the a sence j 
vidual data on social status, regional aggregate statistics hate « 
to describe characteristics of \olcrs " While v.c recognize e ^ 
comings of inferences from ecological correlations, two bene i sc 
derived from these studies First, they take into account t ® 
context of partisanship on a comparaine basis Second, they 
strate that a strong relationship exists between the social s a 
aggregate groups and political partisanship 


NEIGHBORHOOD AND INDIVIDUAL SOCIOECONOMIC 
INFLUENCES 

It IS our contention that such findings arc meaningful though 
not assume that the characteristics of the community reflect the c 
acterislics of any given individual m that area We hypothesize that 
local community can serve as an important means of political orien 
tion for a person, regardless of whether or not he shares the 
status of his neighbors That is, wc arc suggesting that the communi y 
IS one of the bases of political diversity discussed by Shils and is there 
fore a functional alternative to voluntary association membership 
terms of Komhauser’s model 

Our data are drawn from a study conducted in 1963 by the Bureau 
of Applied Social Research m nine towns in northeastern United 
A total of 1,382 respondents were interviewed on a block probabi i y 
basis For purposes of our analysis, 127 respondents have been omitie 
because wc could not judge their socioeconomic status We have 


Bcrclson ei at op elt p 98 

“See for example. V O Key St , Southern PoUua in State and Nauon 

YorV Knopf 1949) 

•*W S Robinson ‘ Ccologral Correlations and the Behavior of Individual* 

American Sociological Review J5 (I9SO) pp 585-595 
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denned -'Sl-borhoods m ^ ^ 

though we rccognaed the gy 

hood may dtfTer front our „a,„s (SES) of re- 

thc mean score on our indes for whom we could 

spondenls lomg m it, t |„|jex of neighborhood SES. 

"°\'“7a“me«uTor'n",Ehb;.lod homogeneity, this is the standard 
rvlrn^TsELcores within ^bc—^ 

Table I shows the “Among low-SES respondents, 

neighborhood SES, and Pttrty cbt»“ ^ J nent in high-status 

27 per cent in '“''f TeS 

communities consider "t""'' ,e,po„denls, 40 per cent in low-SES 
centage points Among high-SES P neighborhoods choose 

nSorhoods and 66 per “I" “ ^f“r,'dVerele Not only does 

Table I 

PatcFMTAOF “’iMp'sEr _ 



neighborhood ,0 a local climate of 

tpimonthatinnuen ^^l^^^^gjneousstatM ^p„hoods Thus. 
Jshould predict that 
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fromda,a.h.,,Enore.hese^ar.bles0.hcrs^d,esofpol.^^^^^^ 

have concentrated ,n single communities While ,y 

analysis of the effects of organizational membership, cross rn f 
comparisons have not been possible Thus, the horsj^u''^? 
hypothesize a “breakage effect" — that is. all other th ^ ^ 

people Mill tend to vote for the party supported by the cfim 
opinion of the communities in which they five But the ft 
effect IS observed in Elmira, New York, docs notallow us to g 
It to the whole of American politics ,n the 

Another approach has received a great amount of a 
study of voting behavior in the United Stales In the a 
vidual data on social status, regional aggregate statistics have 
to describe characteristics of voters *' While VrC recognize !. ® ^ 

comings of inferences from ecological correlations, two bene i s ^ 
derived from these studies First, they take into account t ® 
context of partisanship on a comparaine basts Second, they c 
strate that a strong relationship exists between the social s a 
aggregate groups and political partisanship 


NEIGHBORHOOD AND INDIVIDUAL SOCIOECONOMIC 
INFLUENCES 

It IS our contention that such findings arc meaningful thou^ " 
not assume that the characteristics of the community reflect the c 
actenstics of any given individual m that area We hypothesize tha 
local community can serve as an important means of political orien 
tion for a person, regardless of whether or not he shares the so 
status of his neighbors That is, we are suggesting that the communi 
IS one of the bases of political diversity discussed by Shils and is t e 
fore a functional alternative to voluntary association membership * 
terms of Kornhauser’s model 

Our data are drawn from a study conducted in 1963 by the Burca 
of Applied Social Research in nine towns in northeastern United 
A total of 1,382 respondents were interviewed on a block probabi i Y 
basts For purposes of our analysis, 127 respondents have been omittc 
because we could not judge their socioeconomic status We have 


*^Bcre\wnetat.op at p 98 

“ See, for ocamplc. V O Key. Jr, Southern Politics In State and Italian (* 
York Knopf 1949) „ 

‘*W S Robinson. “Ecological Ctorrelations and the Behavior of Individuals. 
American Sociological Rerlew 75(1930). pp 585-595 
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hypothesis, for it requires that the effect of community structure on 
partisanship be shown to be spurious when we control for organiza 
tional participation We wish to challenge this hypothesis and hc shall 
present the appropriate data presenlly A third possible explanation is 
that community socioeconomic structure is reflected in primary group 
associations Let us assume for a moment that a person’s primary 
associations in the local community are selected more or less randomly 
This may not be true, but it is a useful working assumption Let us also 
assume that persons of tow SES are predisposed to be Democrats and 
individuals of high SCS are predisposed to be Republicans It is 
obvious that, if one accepts assumption of random association, the 
mean SES of the primary groups is the mean SES of the neighborhood 
in which (hey are formed But when we ask which political party has a 
majority in these primary groups formed by random assoastian, aa 
interesting pattern emerges By calculating binomial probability 
distributions, we can sec (hat tn an area where there is a majority of 
either high or low status people, and thus by assumption a preponder- 
ance of either Republican or Democratic sentiment, the likelihood that 
primary groups will be dominated by members of the majority party 
IS higher than the proportion of adherents lo that party m Ihe com 
mumty For instance, Table 3 shows that if 60 per cent of the people 
in a neighborhood are of high status and arc thus predisposed to be 
Republicans, 65 per cent of the three person primary groups and 68 per 
cent of the five person groups will have a majority of Republicans, if 
we assume random association In other words, the majority sentiment 
m the coromunily is accentuated mpnmary groups Social psychological 
theory tells us that (here exist strong pressures toward similarity of 
attitudes in small groups, this notion and the calculation of some 


Toble 3 

PROOADIUrY THAT PWAIAKY GMUPS Witt HaW RETUBUCAN MAIOJirV 
(Percentacc) 


Three person group 
Rve person group 


Proportvm of High SES Persons* In 
neighborhood 


20 30 40 SO 60 70 SO 


JO 22 
6 16 


35 

32 


50 

50 


65 

68 


78 90 

84 94 


All assumed lo be predisposed to Kepubtfcans 
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those hving in more homogeneous areas tend J’' 
m greater proportion than those m heterogeneous 
the same manner, there should be proportionally mor ^ 

homogeneous than in heterogeneous high-status “mmu J 
shows that this IS, in fact, the ease for both low- 
respondents, although we cannot make a definitive conclus 
how SES affects individuals in high-SES areas for lack o 
respondents on which to base a reliable percentage 

So far, we have established that neighborhood structur 
significantly influence political behavior We can thus ^ „ 

the local community as one of the several bases of politica 
the United States Such a notion provides an adequate desenp 
what IS seen in our data, but it docs not provide an 
processes at work there One possible explanation is ^j,ly 

structure, as measured by our variable of neighborhood S . * 
an index of the extent to which the organization of one which 
other has penetrated the community We do not have data wi 
to test this possibility, but Putnam's study has indicated t a 
organization in the community has little influence on pohlica c 
when other variables are taken into account ** A second possi i 
that neighborhood structure is reflected in the kinds of formal a« 
tions into which a person enters and that what we observe as an c 
neighborhood is but an artifact of voluntary organization mem r 
patterns This explanation is, in fact, equivalent to Korn aus 


Table 2 

Percentage Republican by Respondent’s SES, 
Neighborhood SES, and Neighborhood Homogeneity 




Respondent s SES 

Low 

Homo- 

geneous 

Neighborhood SES* 

Low High 

Heiero- Hetero- 
geneous geneous 

Hub 

Homo' 

geneous 

Low 

Total number 
High 

Total number 

25% 

(276) 

33% 

(79) 

32% 

084) 

44% 

(115) 

48% 

(93) 

63% 

(125) 

41% 

(17) 

71% 

(80) 


• In order of hypothesized increasing Republicanism 

» ° Pumam. ‘ Pohtica! AlUtudes and the Local Community,'* American 

Polillcal Science Rerlew <50 3 (1966). pp 640-654 
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the effect of community structure on 
hypothecs, for .t nnouA when »e control for organ, ra- 

partisanship be shown to be ‘P hypothesis, and we shall 

tional parlieipation Wa™* ^ third possible explanation is 
present the appropriate data P“'" J primary group 

that community socioeconomic s'™ s„„’s primary 

associations Let us assume '“ “jX ed 0,010^1 less randomly 
associations in the local assumption Let us also 
Thts may not be true, but it is a us f ' ^ Democrats and 

assume that persons of low a P be Republicans It is 

individuals of lush SES ore P™ P j ,g„dom association, the 
obvious that. If one accepts sES of Ihe neighborhood 

mean SES of the primary e™“P' “ ‘ „hieh political party has a 

m which they arc formed ^ by random association, an 

mtirestins pattern emerges By a ma on y of 

fs'C/^SbcproP^-ofadbe^^-p^ 

munity For instance. «iatus and arc thus p ccner 

in a eent°lf the three if 

Republicans, 65 per ee 3 majority 01 p „„,im 5 nt 

cent of "'?“«soSrn™ other words, .he ™a^0nvse^ 

ne assume random ^, 3 p„moDg™"P ^'^,„„larity of 

attitudes in small gr J 

OnoSlEH-REFUaucANMAlonrrv 


Probability that pw^^AR^ (Percentage) 



• All assumed to be 
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towns wnhont a dear political tradition or 

neighborhood SES, then, are somewhat m.ti^ted ^ 

at the town level, the local community most 

tisanship of its members where neither party ^ a, ne 

allegiance of the town In this sense, strong be 

town level and the same sentiments at the neighborhoo 
seen as functional substitutes for each other 


THE EFFECTS OF RELIGION 

In the same manner as we have examined the cHects -i 

climate on the relationship between neighborhood , „oi,5 

san choice, v-c should investigate how membership m ^^jon’s 
community impinges on this relationship For our purposes, a ^ 
religion rather than his religiosity is most important to us 
assume, on the basis of Lcnski’s findings, that a person s ass 
with his coreligionists rather than his formal ties with ^ ^ ^ 
most important in forming his political sentiments * V/c s ® - 

assume Durkheim's conclusion that membep of the Catholic 
and Jews are more integrated as groups than are members of Pro 
denominations Following Durkheim’s logic again, we should p 
that members of more cohesive religious groups — Catholics 
Jews — are most immune to political currents in the local 
An additional faaor impinging upon this relationship is that t c 
and Catholics have more deeply entrenched political cultures th^ 
the more socially heterogeneous Protestant group Both 5 

Jewish traditions favor support of the Democratic Party ** ^ 

shows how neighborhood SES affects party preference when we 0 
constant the respondent’s religion The data clearly show how 
background is a strong determinant of political choice Regardless 
their mAn SES and regardless of their neighbor’s SES, Protestants ^ 
consistently Republicans m greater proportion than are Catholics o 
Jews But our mam concern 15 wiih the effect of neighborhood compoi^ 
tion when we hold constant the religious factor, and the summary 
percentage point differences at the bottom of the table show a significan 


rar/or (New York DouWeday Anchor, 19^*1 

TT " 

I. Raid McNally 


Bthaiuyr of American Jews (Gkncoc rr« 
J>'rfop-nent and Pm.s'cncc of Eihn*: 
American Fo} ikol Science S9 11965) pp 896-5W 
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Tables 


Son Proiestani 
Seigbborhood SES 
Loy "«'■ 


Protestant 
Seiglibo^l’ood SES 

LOH fl‘Sb 


Respondent’s SES 

Low 

Total number 
High 

Total number 
Effect of neighbor- 
hood, m percent- 
age points 


16 % 


36 % 


36 % 


27 % 


( 196 ) 

’( 92 ) 


( 61 ) 

% 

( 123 ) 


( 261 ) 

% 

( 99 ) 


61 % 


( 49 ) 


85 % 


( 149 ) 
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pent diflerencP for non-Protostnnts - a 
that obtained for party affiliation - and “ ^5 ^rcentag P 
diffierence among Protestants Protestants, then, 
tivc to the politics of their neighbors Stated in mor E ,^^5, 

can say that members of more cohesive religion g 
affected by political pressures at the neighborhood 
elements of such cohesion is subscription to a commo 
cultural values, one of which may dictate a preferenc 
another political party, as was the case with Catholics an 
bership in a cohesive religious group, then, can scr^e in p 
of orientation to the larger political order ^ -mice that 

We shall now turn to an examination of Komhauser s pr 
the proliferation of intermediate voluntary °[S!tnizatioi«^s 


for the maintenance of pluralistic soaety Just as Kom 
argued that the presence of such organizations is r-ynd 

political integration of society as a whole, other analysts ® 
that membership in such voluntary organizations helps m 
individual into the political order Thus, Almond and Verba, co 
for both education and the respondent’s nationality, have luu 
members of voluntary orgamzations fee! a greater sense o p 
competence than do nonmembers We do not wish to chal eng 
finding, and, in fact, we cannot do so within the scope of our ® ^ 
wish to show only that m the absence of participation m vo 
orgamzations the neighborhood or local community strongly m u 
the political behavior of its residents and thus renders them lessavai 
to manipulation by a national political elite 

An index of organizational partiapalion was constructed ^ ^ 
bimng the number of organizations of which the respondent 
member Viith the number in which he took an active role If we ass 
that the greater an individual’s participation m organizations, ^ 
integrated he is m groups outside of his local community, we sho ^ 
find that organizational participation diminishes the effects of netg 
hood structure on party preference Table 7 demonstrates how 
tional participation is related to the effects of neighborhood SEb 
party choiir The table shows, first, that people who parlicipa « 
voluntary organizations are more likely to be Republicans than otn^^^ 
who do not participate, regardless of their own soaoeconomic sta 
or neighborhood SES This finding has been clearly document 


*’GabneJ Almond and Sidney Verba, The Cmc Culture (Pnncclon 
University Press 1961) Chapter X. 


pnecetoo 
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elsevhere Rnd we need not '^^or the pomt m table also 
indicated by the summary percentage point “ o„aiii- 

that the effects of neighborhood on partisan ‘^''O-ce decrease ^ g^ 
zational participation increases Among respondents wh 
no organizations, neighborhood class composition J, 

averageof25percentaEepointsdin'erencemproportionso^^^^^ 

Among those whose participation is low the margina y 
neighborhood SES produces a 22 percentage 
among high participators — those most committed to formal, 
organizations — neighborhood class structure results in on y 
age points difference in proportion of Republicans The gr 
extent to which a person is involved in voluntary organizations, 
sensiti\e he is to local neighborhood pressures in the forma ion 
political attitudes This is merely a special case of the more gc 
proposition that we base now established the more a 
tegrated into the political order by secondary group ties outsi 
local community — attachments to the sentiments of his 
coreligionists, and to voluntary organizations — the less the lo^ 
munity in v-htch he lives influences his party choice Conversely* 
v.ho are least integrated into other secondary groups are most sw > 
by partisan currents at the neighborhood level 


SUMMARY OF FINDINGS 

A short recapitulation of our findings may be in order here We 
first, that neighborhood socioeconomic status exerts a pov.erfu c 
on people's political party choices This effect of neighborhood ^ ' 

in fact, greater than that of a person’s own socioeconomic class stan i 
Wc find, second, that the magnitude of the effect of neighborhood c 
composition on party choice is inversely proportional to the exten 
which a person is integrated into the political structure by ot ^ 
secondary groups Political pressures in the local community can 
said to operate as functional substitutes for other means of integrate ^ 

the individual into the political order U may be argued that those o<^ 

political parly organizations arc strongest which best represent t ^ 
economic interests of the majority of residents in a community, 
that because of their superior organization, they may influence mem ^ 
of the community who represent minority economic interests 
of reasoning is questionable in the light of Levin’s finding regarding 


*• Bcrelwn ei cl op cit . p 52 
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school students are inlluenced by . |.„,s i» Surely, it isould 

hoods regardless of the party c oi mlluencc adolescents 

be difficult to -P'-”, Pf;„rrs «plana.ion, then, must be 

without mnuenemg tlmir P='™® ngsaretheresultot social 

rejected It might also , however, when mobility is 

mobility The community effects persist, 

^rSadosuggestthattlmmamd^ 

but rather in terms ““"“““Lhob for mass society theories We 
These findings bear formal voluntary organi- 

challengc Kornhauser s contention th y „„,ead 

X can render nonehtes '"'.'“l 

that many forms of association, bu j ,„io the political 

ZZl can serve “''a XT 

structure and thus provide bases to contend that the im 

Kornhauser-s model of PP'"'“‘ ’’'“'“'""’s'Tffier 

portance ofthe "'‘S'’‘’'’'';°„,nt^he political order by means oo her 
L an tndividuars ':X Mhough Wilensky s'" “ 
associations becomes more tenuou^^,^ onhe^lo« 

tion, which our ^ L ,„i|,vidoal into mass socie y,^^^^^ 

community m integrati ^ ^(h his assumption that . gening. 

selves generally m agreement «th n hips ate nP‘''e““ J, 

:Xded.ntrad,t,ona ormsof^ 

although the quality on^htociety „r •totahtanen have 

of the discussion undcrestim , . YJ^lable,’ 

sistence and *“"’''“>',“1 Xntty been kept •atom.red, 
nowhere m any dev ^ powerful 

■unattached,- 'in motion .immunity ese 

The fact that parti^n fc ew« ,o radical 

an mlluence on a 'Africans may not be. n 
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, deolog.es or sudden shTts .n party 

suasion cannot overcome the way a person s social se ting 
his political beliefs Our data add to the j>lteajiy 
evidence that people do not make their political choices ’ ^ 
matter, any other kinds of decisions — as isolated, atomize 
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/,„„d /,„«, ca.,al ,hc man »/ .l.^« P;“f ,J 

4„s . van^y of ,.s^,Jiccn: oocs fir ,hror.z,na end nUmn firM 3 J 
alurnative interprelaiions . . Pr^n^ //£««' 

In bis pa, ns:aking analysis oj data for the wra ° ^ J„ bis earUn- 

Doaan has also underscored the need for mahodologica fexi ) ^ 

n-nrk hr prnrrM drrcpl.rdj through the "" »/ jj, 
rourrrr ofrtrrngth fir rorh party hy rtrurnuag rtitrrnnt.rr IrvrU ^ 
rupportfir thrCoLtumrt and thr Sortulut purtirr, hr war ahU to gmrr ^ hg/^ 
ruggrsttrr hypothrrrr about thr rourrrr of rrgtooul '“21 paprr 

lotrr work hr hot rrplorrd a hroad rongr of rtattrttral prorrdurrs 'f' 
this \olume ojers detailed results oj an attempt at an anajsis oj 
Thr txtruordioory nclintii of thr Frrorh data hat not yrt rm rx 
It IS clear already that it is not possible to get much further m F 

of the total rartance on the basts of aggregate data alone To pw J 
toward the explanation of the residual variance, will no doubt e n 
organize ecologically controlled sample sun^s in each of the pohtiea ) 
regions of France on attempt in this direction is alreadj under noj an 
add further to the model of explanation suggested through the ona/sis J 
aggregate data available 

The French and the Italian political ecologists have gone further t an 
else in Europe and hate narked cut models if analyses nell worth lertintf 
other countries A great deal can still be done in Ffonfc ond Italy on t e 
backlogs ^historical ccolo^icof data, and it n not inconceivable that a 
attack an the detehpmenl of typologies if early pohneoJ constellations in ^ 
locality will pay of e>cn in the explanation if post 1945 variances t 
subject to which we shall return in Part Fire 


* See especially his chapter on France and Iialy m S M Lipset and S 
(cds) Party Systems and tarer A/ignmenls (Sevr York Free Pr»s 
129 195 

*Cf the very interesting Italian compansons of continuities m voting strenS 
from 1919 to 1946. in Capecchi op c/r.pp 21 52. 
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Determinants 

A Linear 


of Voting Behaiior m Italy: 
Causal Model of Anlyasis* 


When one conHders the 

t„hr«Ua.nUroducedf«.oN^^^^^^^^ 

vanablc dependent on other f t both elect ^ J 

z\td'’o;‘L:-;^— 

As a consequence o consiructcd fron^ wnsis of 

these indices of , gj constructed on the 

communes, with ‘>'1'" ‘n^'“ in r'h.eh such data n 

dataThesedatahavebeenused. 

. The reseanrh^or ?Sf 

Moana »nh a T;"f“tS l«W«and..t»sedo „„„„ 

piniopation co'crs I > y««_fl’provinccs.ancl» r,vjt,2ia and 

provinaal level T.lvdi'idfd«»*o*^^„c-i of Aosta snah'Z'td 

£rs5E£-"«'tt^— 

VOS 7,144 
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at the commune level 
divided into two mam 
tions at the provincial 
commune level 


The analysis of this material, 

parts In the first, we shall examine he com\ 

level, and m the second, the correlation 


CORRELATION AT THE PROVINCIAL LEVEL 

Eighty^eight territorial units ixerc used for the 
indices, four Italian provinces Aosta, Bolzano. ,hc 

were excluded for lack of homogeneous data Indices relati „ 

1953 and 1963 elections were constructed with statistical 
1951 and 1961 census Tlie 1961 agricultural census was used 
agncultural data Indices relating to income were cons^ructe 
basis of data assembled by G Tagliacarne It was a o po ^ 
compare factors regarding economic management, in spi e o ^ 
sence of professional classification data, such as for wage 
mezzadn (sharecroppers) and farmers, in the 1961 genera 
These data had been available m the 1951 census, and tor 
substituted data relating to area under cultivation by mezzo ri ’ 
farming or the employment of wage earners j.—cll) 

parapolitical organizations, and press circulation were provide 
by the respective parties and organizations , . jp 

Since we were limited as to time and finances, we were 
calculate correlations for each of the eight national parties In ^ 
correlations were calculated only for the two parties that , p^j 
at the time of the research, the DC (Democrazia Cristiana) an 
(Partito Comunista Italiano) and the others were classified as ci 
non Communist Left or Right 


The Communist Vote 

For each of the major parties, we have prepared a table of signi 
indices of correlation It should be remembered in the apprai^ 
these indi«s that, on a basis of eighly-cight units, the correlation 
significance only from the 0 25 level upward, even if the lower mm ^ 
constitute a useful point of reference In the following chart, the fart® 
positively correlated with the Communist vote m 1963 are listed 
order of diminishing significance 

022 

Members of the party ® 

Members of CGIL (Confcdcrazione Generale Italiana del , 

Lavoro) 
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636 

530 

449 

310 

282 


Circulation of L'Uniia 
Mezzadrta (sharecropping) area 
Vote for the Republic m 1946 
Participation (percentage of voters) 

TV licenses, 1962 

The factors post.tve.y corretated wtth the Co.^n.st vote ,n 
were as foUov.s 

Members of the party 718 

Members of CGIL 629 

Circulation of L’Unitd 451 

Parttcipatioit (percentage “f 

Vote for the Republic in 1946 434 

Mazadr, (sharecroppers) „„ellled with 

The permanence and *“‘’'"li:^Jd''b“tht'’mii'ni=nanc= of posihve 
the Communist vote are „,a,iy the same vel 

that was significantly con' J„f me ,,e provincial 

Identifies a social c4‘'f;/„':,'5 „„ded dale "'“""f "‘ 0,10 use data 
Since the censuses have not R preferred i^ot m 

distribution of 'ntl''"'”' somewhat uncertai 

in industry, we hnow Communist vote 

of the population which man 

tion fMTcentag® * ih a democratic system o 

In Italy a ^ industnal country wi part due 

any other f«“t "^l,t, cal elections of 1946 

government In the first 
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to the “novelty" of voting after twenty years 

the Catholic organizalions actively encouraged e ry _ _ 

ulanzing the slogan “Vote for whom you like but vot 
centage of the electorate has continued to vote in tne 
elections, although to a diminishing degree favoring the 

This tendency has been considered to “^.'/colunist 

non-Communisl vote In other words, it is held ttatt 

voters, who are more "faithfuK” committed, and . strongly 
the polls, whereas the non-Commumsts, who son, 

committed or involved, might forego their vote But if thes pe 
were to vote, they would, in general, vote for a posi- 

This popular assumption, however, seems to be refuted J ^ 

tive correlation between the Communist vote and electora p 

U seems that in areas in which the highest percentage o ci i 
the Communist and left-wing vote in general is increased 
corresponds, as we shall see, to a negative correlation between 
participation and the Christian Democrat and right-wing yo 
Another popular assumption that must be modified on e 
our positive correlations relates to the spread of television j, 

monly believed that exclusion of the PCI from the control o 
because it is an opposition party has had a negative clectora _ 
the PCI However, we find a positive correlation between e ^ 
and the Communist vote which corresponds, as we shall see,^ 
negative correlation between the exposure to television 
Christian Democrat vote It seems that the Communist Party, 
secular traditions, is better adapted to mass culture and televisi 
pnncipal medium, than the Catholic-inspired movement vyever 

This conclusion must be accepted with great caution, o 
Although television is a particularly powerful means of breaking 
cultural barriers and diffusing new behavioral and political 
there may be other elements that are affecting the correlation and s 
be considered j 

Television was first introduced into the economically more adva 
sections of the country, sections that have been traditionally Soci 
The working classes of these areas, which had achieved a highef 
dard of living, accepted and adapted quickly to new mass-commun 
tion media, including television K-iueen 

In spite of this reservation, however, the positive correlation be 
television and the Communist vote, and the negative 
tween television and the Christian Democrat vote are too constan 
general not to be significant, as we shall see when we examm® 
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correlation nosiliveiy correlated with the 

To complete Ihts summary “f^rfaclors .mmed.ately below 
Communist vote, we should indi 1 „„„„ 

the level of sismricance, about th^O 11^^^^^ ,„|lucnced by both 

The Communist ^„rts of the working class to 

economic development and the activ 

improve their economic “"J ,o a significant degree 

We may now consider the ft Xhe situation m 1963 

negatively correlated with the Communist 

was as follows 33O 

307 
305 
296 


492 

410 

334 

290 


StrsT^MConfederaaione, Uliana 

M:ir"I.mn ~. party 
Area of peasant small holdings 
In 1953 the situation was as follows 
Farmers working own land 

E’^rAheChrutian^^^^ 

Members of ACU (A«ne OU ^ 

The factors listed here are of ^ eatego^ ^related 

relate to orgamiation and PP'> „„,s, vote is "egahvely 

According to our nhar'. „a,or nn'“sn"^ also 
with the degree »f ,„d Chnsl.an “d by the 

movement (the CISU, • ^ category Spoiled 

negatively -telatc^^'^^ “ p-S ^ 

The educational level Democrat 1963) 

with >lW«ney, wbe«s Ih value was ^ primaD =<>““• 

in this respect (this ap^ j^oratc has a h'S^er « seem that 

The Christian orate Theoretically. 

lion than the Communist elec 
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.he elector «,.h a pnntary educafon woald have 

voting procedure, sufficient to avoid the annulment . 

Christian Democrat vote, however, is positively ^ . Antes 

votes According to this information, it seem^s that 

are the votes of moderate electors, from the Chns la « 

to the right wing, whose parties have not taught them to 

same care as the PCI teaches its voters However this supposition 
needs further substantiation in the perspective of the entire p 
and sociocultural context „ . 

Immediately below the level of significance m 1953 ''e 
variable of women working in industry {— 224) This in ica 
women generally do not vote for the PCI even when they 
industrial environment In 1963 this variable was somew . 
(- 136), ten years later, there were slightly more women who 
for the PCI 


The LefhWmg Non'Communist Vote 
In 1963 the significant factors that influenced the 
Communist vote (Parlito Socialisla Itahano, Partito Sociahsta ^ 
cratico Itahano, and the Partito Rcpubblicano Italiano) vicrc 
follows 


Pnmary education 

Vote for the Republic 

Industrial population 

Income 

Participation 

Strike hours 

Blank votes 

Radio-TV licenses 1962 

Secondary education 

Women working in industry 

Circulation of L'Unita 

Population working in commerce 

Members of COIL 


6(W 

553 

551 

543 

473 

440 

437 

430 

415 

392 

306 

300 

262 


In 1953 the situation was as follows 

Votes for the Republic 
Blank votes 
Pnmaiy education 
Members of CGIL 


743 

606 

596 

590 
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541 

.514 

457 

453 

388 

366 


Income 

Circulation of L'Unii^ 

Participation 
Members of PCI 
Population working m industry 
Secondary education 

These correlations relate 

tevel, orthe percentage of. 

and secondary education, (2) tit and commerce, and (3) 

percentage of population 7^'-'"/” ri'Sn. economic grorrth and 

Lome has been positively in- 

income, the left-wing no modern industrial society 
nuenced by the development of mc^ a develop- 

Although the non '^”1 ,t shares with the Communist 

,ng industrial society ‘■'f" * “ J^ta relating to pamcipahon and 
electorate, but to a greater d g . 

the influence of television ,af,.„,„g vole and blana 

There is also on high ^L, ,he blank vole, as opposed to 

voles It .pay be suggested, 7," ^ct, not from igao^cc « 

Zrvo1errefuUfU.he.g^^^^^^^^^^^ 

Totmg^^rrrnceL^L^^ 

As sve have pointed out, 

LLLr:rer:.eHven.^^^^^^^^^^ 

roLLXrs\LLsn--^--^^^^^^ on the indusma. 

by which workers can 

community. . -^e comments o y,jj 5 snU 

now remains PPP-CPP-'^'Lcugh both member- 

izational factors ,|^a PCI otgamral'°" . ® j two thirds of 

sigmf.ean.ly mfluen^ hy of 
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correlations with the CGIL 


membership and with 
be easily 


degree the positive nctor can be easily 

the circulation of the Communist paper The first . f, 

explained The COIL of Italy has been the trade 

Socialist tradition, even though it has been more imp 

to the PCI than to the non Communist 

however, is more dilhcult to explain, as L f/mrd hcavi y 

non Communist left wing in the years immediately prKcding 

electoral campaign and even during the campaign itsc roiin,v,ne. 

In our opinion, the interpretation of the correlation is 'j’ . 

During these years, there was a reduction in the size ^ 

of the non Communist left-wing press La Gtmuzta 
Voce repubbheana was reduced in size, and the A\anti cc i 
democrats and socialists who had read these papers. 

lamed their attitude ofresentment or rescr\e toward the in ^ 

press, which rcficcted moderate and conservative opinions 
L’Umta which is the only national * people’s” newspaper. 
popular with the left wing non Communist sectors of the pop 
On the other hand, these sectors have attained an i -,j,g 

cultural level that permits them to evaluate critically the editon 
of the Communist press, and the reading of the newspaper i 
necessarily followed by an acceptance of its political bias 
These situations should be clarified by further research There 
need for data regarding newspapers with different onentations . 
they may be compared, as well as research other than of the 
type This need for further classihcation, however, in no way m'a * 
our ascertainment of the basic characlenstics of the left-wing ^ 


Communist electorate In terms of economic activity, income 


level. 


and educational level, it is comprised of the upper stratum of the emerg 
ing modem industrial society 


The Christian Democrat Vote 

The DC voting patterns further clanfy the general electoral 
The significant factors that positively influenced the DC vote m 
were as follows 


Members of the DC 
Spoiled votes 
Members of CISL 
Members of ACLI 


496 

359 

J12 

293 
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TTie situation m 1953 uas as rollout 

Fanners working own land 
Primary education 
Members of ACLI 
Members of DC 
Women in industry 
Population working m industry 
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448 

368 

359 

324 

275 

253 


As in the case of the PCI there is a prevalence of organizational data 
Tlie predominance of these data is compared to social factors makes 
the DC electorate the direct opposite of the PCI 
Tlierc arc also some significint changes in the electorate of the two 
years The positive coireblion with the population and women working 
in industry which was present in J953 had reached almost the zero point 
ten years later The two impornnt pieces of social data of 1 953 the 
level of primary education and influence among farmers working their 
own land were i little below the level of significance (about 20) in 1963 
In this regard the dilTerence in kinds of information on which the index 
relating to direct farm management was constructed should be kept m 
mind 

On the other hand there is a negative correlation above the level 
of significance between the DC electorate and the developing industrial 
society The situation in 1963 was as follows 


Members of CGTL 
Members of PCI 
Circulation of L Uni/^ 

TV licenses 1962 
Income 

Vote for the Republic 
University degrees 
Afe-'i.adri (sharecroppers) 
Participation 
Strike hours 
Secondary education 
Population working m commerce 
Blank votes 


702 

688 

675 

534 

443 

417 

399 

367 

352 

331 

319 

283 

255 


The siloalion in 1953 was as follows 


Members of COIL 
Members of PCI 


720 

609 
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Circulation of L Vnita 33 O 

Mezzadri (sharecroppers) 279 

Illiterates 259 

Agricultural laborers 

As surprising as it may seem, the '^art ‘i^ ITO therT*^ 

was more left wing in character in 1953 than '963 53^ Republic 

no correlation at a significant level between the vote P 

and the Christian Democrat vote (the correlation is ). 
has been one of the constant factors in the DC Tl r electorate's 
ably the result of the more right-wing sectors of the Di- 
orientation toward the PNM (Monarchist Party), which between 

height of its expansion But, m fact, the negative corre a i 
the right wing vote and the vote for the Republic was hig e - ,,jg 
than m 1953 (- 688 and - 870) If the MSI (Mov.men^o ^cial^ 
Italian©) is excluded, the correlations are slightly lower I — -owreia 
— 841) Furthermore, in 1953 there was a significant 
lion between illiteracy and the DC vote, which by 1963 had p 
disappeared These two facts, however, can be reconciled -uihern 
In 1953, the DC lost votes to the right wing primarily 
rural areas These were the votes of an electorate with a low e uc 
level and a monarchical orientation In 1963 , on the other han ’ 
lost most of Its votes to the right in the northern cities These wer 
for the PLI (Partito hberalc italiano) from an electorate with a 

educational level which was not necessarily monarchical In fac , 

were negative correlations in 1963 between the DC electora 


secondary and university education 

In 1963, besides the previously mentioned correlations relating ^ 
moderate tradition, educational level, and the organizing 
PCI, there were other indices in negative correlation with the D 
These related specifically to the diffusion of television, income leve . 
population working in commerce, which are all typical 
relating to the emergence of an industrial society We have 
with the other phenomena negatively correlated with the 1963 D 
in our dicussion of their significance in relation to the left wing vo ^ 
It seems then that m 1963 the Chnstian Democrat vote, msola ^ 
It concerned a senes of correlations relating to tradition and organi 
tion, was the exact opposite of the Communist vote Furthermore, ^ 
appears to have been a vote negatively correlated with the phenom 
related to industrialization It is a vote that is characlenstically uo ^ 
Communist and linked to those sectors of society less affected b> 


phenomena 

already 11“” 
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dynamics of indas.i.al.mlion and urbani^Uon This si.ua.ion picscn.s 
significant analogies to the right tsing vote 

The Sight Bing Bn'e 

Li. d.Bmnces «c have distinguished in our 
In order to assess ...g, „,„g the liberals monarch 

™rdMSra"nrth\ro?,he.^^^^^^ 

There are some major dilfere strongest right wing 

during the period under *^'“""'°" 'J,”trong in what might from 
party was the PNM ""'f backward partsofltay 

a socioeconomic ,,,„„gest right wing party was the PU 

In 1963 on the other hand the ® ||„ advanced parts of 

Ihich was strongest in •>'; -"r„Vto have been the most 
Italy In both years however the „r 

backward sector of the ngh ® working in industry In fact 
rneoltte educational level ^ wing as a whole than for 

all the relative indices ore higher ror nc t 

the right ^Mng with the . be,v,cen the moderate electoraic 

‘ in order to clarify the «'«'2*u',J^r.rn the ""’"sUlfl 

the Cltrishan Democrats * |^^R,ght tn 1953 and w^ 

differences tn the ^“'"'"1°" °J,,hefac?o«s,gn.«cantlyccrrela ed w,^^^ 

confine ourselves to point mg t 'h Christian Democtots 

the vote of all parlies to the 

The positive correlations are 4,3 

329 

Spoiled >otes 328 

Illiteracy 310 

Area of small holdings 

University degrees ,„,t, the members of the DC 

„ can be ^^^.“XrccTcvTm j^jt'rpra'tXeTa" 

Uer IS .he 

to zero ‘‘'•''P^LlmembetshtpCdW 
correlation with ACLl m 

The negative correlations were as o 

Vote for the Republic _ 458 

Primary education _ 410 

Members of the PCI 

Participation 
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Members of CGIL _ 394 

Members of ACLl _ 374 

Mezzadn (sharecroppers) _ 322 

Circulation of L’Unita __ 319 

Blank votes _ 248 

Population working in industry 

From these correlat.ons U .s dear that the right-wing vote is prmrt 
correlated with conservative, monarchical tradition It is ^ 
positive correlation that generally appears for the DC to a sig 
degree whether at the provincial or commune leve » 
exception of the 1953 elections analyzed at the provincial level 
Furthermore the right wing vole is negatively correlate 
level of elementary education and positively correlated * \ector- 
This IS the major difference between the right-wing and the c 
ales, and it seems to be the educational level that transfers the . 

vote from the Right to the DC This is even clearer at the 
level, but the correlation concerns only about one per cen 
population, usually from the privileged high-income groups In 8® ’ 

however, with this one exception, the right-wing vote is 
correlated with income, an element that is analogous to the DC e c 
ate The positive correlation of the right-wing vote with difiusi 
peasant small holdings also indicates an analogy between t c 
electorates 

With regard to negative correlations, the most significant, as we 
seen, relate not to organizational factors, as m the DC and ® ^ 
ates, but to traditional and educational factors In this sense, the ri^^^ 
wing electorate is in diametrical opposition to the non Communis 
wmg electorate p^j 

We also find, of course, that the organizational strength of the 
ncgatncly influences the right wing vote, and other factors posi •' 
correlated with the left-wing vote, such as level of participation, num 
of blank votes, and extension of metayage area, act in the same w 
But the organizational strength of the PCI positively influences the 
vote to a much greater degree than the right-wing vote It is very 
cant that the membership of ACLI, which is the left wing of the ^ 

parties negatively influences the right wing vole almost as much as 
number of members of CGIL, and to a greater degree than the circu 
lion of L'Uniia 

As we have pointed out, there are several sectors of convergen^ 
between the Christian Democrat and the right-wing electorates, 3^ 
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here arealso two furidemental factors of difrerenljation cducalionaad 
income The DC electorate is belter educated than the rieliMiinc 
electorate, but it IS alsopoorer Butinlhisregardilisinlcreslinslotiole 
that the situation in 1963 was the reverse of the one in 1953 The 
correlations between income and vote were as follows 


i9Si m3 

Right wmg - 453 _ 085 

DC _ OSO - 443 

We believe that this situation is related to the modifications that 
occurred tn the strongest right wmg party between the two elections 
As we have said, the right wmg of 1953 led by the PNM was pre 
dominantly a poor, subproletanan and monarchical electorate On 
the other hand, the right wing of 1963 ledbythePLf was middlcsclass 
and not necessarily monarchical In both of these instances (he Right 
look votes away from the DC in (he corresponding sectors of the 
electorate 

It should also be noted that (here is a nonsignificant correlation but 
onlyslighlIybelowthetevelofsignificance(~ 184) with the number of 
strike hours As we have seen it was at a significant level with regard 
to the DC ( ~ 331) vvhich indicates a less conservative orientation 


Vie Circuits of Carrelafm 

Although the tables of the previous sections indicate the relative 
influence of different variables they do not clarify the relationships 
between these variables For ibis purpose we have constructed causal 
linear circuits based on the coeflioenls of correlation 

For the methodological significance of the causal linear circuits we 
have used the suggcsiions set forth by Raymond Boudon in recent 
contributions * With this type of analysis we can generally determine 
whether variables have a direct or indirect influence and the causal 
relations by which these influences arc determined In other words 
given a dependent variable which in this case is the vote we can 
identify the variables that influence it and determine whether these 
influences are direct or indirect 

For example, if we examine the circuits of Figure I we can see that 
income does not directly influence the vote It is mediated by the 
variable of education, which m turn is sometimes mediated by the 
vanable of tradition and organizaDon 
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roreacHpamcaro„pu„..— 

'/'’:n73The flVsl w'lsor^ general character nherear .he seennd ,s 

more limited, relating “ ^“'j“;^elnpinen. in a couniry ir the 
A well known index of '“"”™ , . almost doubled 

average income of its populati is 

be.w«n 1950 and 1963 F™” Colnnis. kef. the PCI 

positively correlated with a vo negatively correlated with the 

and the nonexlremist Right we „ may be seen that 

DC 110°"'“"""“““"° ?!„Tlihe vole Immediately connected 
income exerts an indirect in of education and successive 

with income in Ihe circuit *i“" ^ ' ^,„o„ ,,„d.ca.=d by a vote for 

links arc represented by such factom as of strike 

r ml^^ nhrr « .nouence circuits are ,n 

“ cLeful examination of 

negatively influence education mediation it 

aSSaSxsSi 

and membership m he m* through membership 

vote participation also 

party (See F'^' ,he PCI is ptacucajjy t opened of 

thJTmterTat “e^^m^for.:.” a t^f-'^^thT.n' mm 

influences PCI membership) „„ always correlated 

“f F- and strike hours are 

These two variables i 
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opinion that television only reduces social 
which people inforra themselves about social reality it 
by which these tensions may be intensified It serves a 
which consequently favors the left more than the righ 

With regard to the Christian Democrat circuit. “ ^ 

pnmary education negatively influences DC neeatfveiy 

positively influences the vote Illiteracy, on the 
influences the DC vote, but it may have a positive ,y 

mediated by party membership Secondary education ‘ ® ^ 

correlated with both the Christian Democrat vote and party 
It IS worth noting, however, that, m 1963, although „hpfshiD 

did not favor DC membership, it did favor CISC and ACL me 
and that there is a positive correlation between . u.p 

Although neither CISL nor ACLl membership favors DC 
and there is a negative correlation between ACLl and DC mem 
they both lead directly to voting for the DC, bypassing party 

It seems that party membership is a filter for the votes 
Membership in the CISL and the ACLl, on the other hand, 
filter for persons with pnmary education In a similar wa> par 
in the various forms of the Catholic organizations appears to co 
m favor of the DC vote There are other factors in the circuit, how * 
which act against the DC vote The left wing tradition, secon 
education, television, and strike hours, both directly and throug 
parly, function m this way 

The causal circuits of the non Communist left wing and t e 
Wing form an antithetical pair The circuit that leads to the le l 
vote — pnmary and secondary education, republican tradition, 
participation, with the addition, m 1963, of the spread of television 
strike hours — is the antithesis of that leading to the right wing v 
all these factors act negatively, whereas illiteracy is positively corre 
with the vote 

As we can observe in the diagrams, there was no significant 
between 1953 and 1963 for the Ufi, the PCI. or the DC But as 
already indicated, there was for the Right The situation 
regard to only the 1963 circuit relating to the right wingwithout the 
This IS fundamentally the libera! Right (7 per cent of national vo 
as against 1 7 per cent of the Partito dcmocratico italiano di . 
monarchica) In this last circuit, the left wing tradition strike hours a 
participation continued to influence the right-wing vote negative 
Secondary education, however, cither directly or filtered by tclcvis>o . 
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positively moderate right viing of 

a significant degree ( 32 ) j .^ban, more closet J 

"Mcanbesee„,th=secireu.tsd27ptSS 

represented by the percentage j found. howe\cr, 

commerce as compared aith that oftneome 

srr-s:: 

"sr,;. - -• rr-- si 

minor integrative circuits ' . , f„, |053 the circuits could be 

relevant census data was available on , „,c factors that lead to 

constructed only Tor that y«' vote the other shows 

the Communist and the non Comrn“^„ocrat and right wing vote 

the factors that lead to the P"'" ,ase the final hnl is represented by 

with or without the MSI mere*/ in the second the 

the prevalence of “'^^^Sene. of farmers cultivating their 

final link is represented by P 

Fascist Right structure of society _j,ug for its 

Although the favoring the PCI “““"rpohtieo- 

tcfl wing, there are oft ^ f^^^^ important “f “ ,„dc union press, 
position as second institutions „ jc„cy to vole for 

social organization considered as a natu ,n (gc 

etc , direct what migh ^ „ .hould between the 

the Left toward a appear to be any ,bal there 

frinret;rnd.heCom^^^^^^^^^ 

=rr;:f=2siK=;::-‘”-" 

party workers, who are nonp 
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be the channel through which general .nrormat.on and Communist 
propaganda reach ihc illiterate section of the 

With regard to the right wing or moderate electora , 
lutionary factor that clearly favors the majority Jf 
mentary education but even this factor favors e „p]ained 

Left more than it favors the DC This characteristic be exp!^ 
by the Catholic bias of elementary schooling in Italy ^ ^ 

effect of education fa\ors the Left, but this tendency i 
redirected by the religious orientation of Italian education 
As with the PC! vote, the Christian Democrat vote 
on organizational factors As in the former case, the P^*^ ^rnimulate 

whichtheilliteratevote iscoUected.and thcClSLand AC a 

the vote of those with primary education Also its an 

organization of farmers working their own land explains t c 
Democrat prevalence m this sector m spile of its circuit 1 cn i 

the Right factors, 

The relationship between the Right and certain evolutionary 
such as secondary education, spread of television, and mco ’ 
already been indicated In 1963 these factors tended to favor a 
and conservative Right, as represented by the PCI, and not 
element, such as the PNM in 1953 and primarily the MSI in bo 
and 1963 


THE CAUSAL CIRCUITS AT THE COMMUNE LEVEL 

The causal circuits at the commune level were constructed on 
basis of the same criteria as the analogous circuits at the 
level With these circuits, however, a greater number of units 
considered On the one hand, this rendered all the correlations 
cally significant and, on the other, permitted comparisons 
at the national level with the six historically important areas 
zones have been characterized m terms of the predominance in cac 
the Communist and Christian Democrat Party subcultures 

Zone / (Northwest Italy) includes Piedmont, Liguria, and ^ 
bardy, excluding the provinces of Bergamo, Brescia Mantua, 
Piacenza In the late nineteenth century socialism was intro u 
into this zone . 

Zone H (Northeast Italy) includes the provinces of 
Brescia m Lombardy, the province of Trento, the province of ^ 
all of Vencto with the exception of the province of Rovigo 
zone outside the industrial concentration of the Turin Genoa*N > 
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mangle, where clencal mfluence .s conarderable In other word. « a 

“white” zone “ , .n, nmimccs of Mantua Rongo, 

Zone III (Central Italy) mclndes th p „(• p,acenza, 

Viterbo, the whole of fh, p„v,ncc of Aseoli 

Tuscany, Umbria, and the Mare P anarchist, and socialist 

Pieeno This is the zone in which hto ^„,„y These 

movements occurred |„s,vc|v acncoltural socicts and 

movements took place vvi hin “ .“S- aone 
were strongly anticlerical It is then ^ ascoIi P.ceno, 

Zone ly (Southern Italy) ^e of Viterbo, Campania. 

Lazio, with the exception of the p ,,as no 

Abruzzi e Molise Apu ha, movement before the 

widespread cultural political „„ catholic Such move- 

Fascist period in this ^“which semifeudal conservalism 

meats are found in isolated areas in 

has been maintained yj j- jardinia Although these 

zone Zconsists of each has clearly defined 

regions share a "’“'f' , characteristics 

and individualistic histor available at the commune 

As the data regarding income ^ hy the percentage of the 

.hlmreut^utveconu^^f-o-^^^^^^^ 

the two jb- noted that id 1 953 tbc\ an ,i],tcracy 

their own land, It . degree positively parties 

the former was to a ^ for the PCI and th g agncuUural 

(482) and therefore with the 'Lnt correlation 

1963, on the f",5ttorets showed no s,gnitonj^_^ 

,919 elections ”„ed . hum P'»P“"“’“ 

nOTnlly»nneveJ«”>lco”“ 
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correlated w,th secondary edncafon It ‘“4 

Christian Democrat vote and, to a lesser degree, the right wi g 
The causal circuits for all of Italy are shown m H-B'”'" ^ 

The similarity of these circuits to those at the provinci ^ 

the patterns already presented With level of industri u^H^een 

starting point in the causal chain, the antithetical relati P 
the PSl and the right wing is confirmed In the first >"5“" ’ 
centage of persons working in industry has a positive in u ’ 

second, the same factor has a negative influence Ananalogo 

exists for the DC and the PCI Although mdustnaliMtion 
indirect influence on both, upon examination of the me la mg . 
in this influence, we find that they are antithetical as wel a 
favors the DC, particularly primary education, negative y a 
PCI On the other hand, those factors which support _ 

vote — secondary education, though this is of little weight, * * ^ 
progressive tradition, number of voters, spread of television, an 
marginal factor of mezzadri — all have a negative effect on 
vole In effect, the mediation of primary education is necessary m 
fonndustnalizationtoinfluencelheChnstian Democrat vote posi > 
whereas the mediation of secondary education and of the sprea 
television is necessary in order for the Communist vote to be 
There are several factors that are always positive for the PS 
from the aforementioned direct influence of industrialization 
primary and secondary education exert an indirect influence 
educational factors were both negative for the right wing in 19 » 
in 1963 the former was positive and the latter negative, and the posi > 

evidently a 


influence, in this case, was also direct This variation was 


consequence of the already desenbed liberal leadership of the right wi o 
m 1963, which reduced the positive influence of illiteracy on the 
wing vote Conversely, the negative influence of illiteracy on bot 
Christian Democrat and Socialist vole was reduced, whereas its posi i 


influence on the Communist vote increased . 

In the commune level circuits, the factor of tradition, indicated y 
a vote for the Republic in the referendum of 1946, remained decisi' 
With regard to the DC, this factor maintained its position 
industrialization and level of education in the causal chain but 
in significance This v-as also the case with the Right, in which tn 
electorate is more ‘ cultured ’ but nonetheless conservative 
The circuit of the PCI remained practically unchanged The 
sivc tradition served as a filter bclvwen the vote and elementary e ^ 
tion both in 1953 and 1963, but tn the former year it had less influence 
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compare it with the six areas 


The Industrial Triangle {Zone /) 

The mdustnallrianglc V ineA AiEJitoMly THie industry 

as can be seen from the <a™'« „„ the Socialist vote 

of the area exerts a greater ,„creased betvteen 1953 and 

than ts generally the case and hs mil eorrelation 

1963 Correspondingly '''"C » a B although this index 

betneen industrialixation and the right 

decreased between 1953 and 1963 aon, posit, on 

may be supposed th „hich wc were u 

rrn;";n.nipo-n.h^^^^^^^^^^ 

t'prcse- “V" ^'HTri^nr'owcver appeal - a Pon..« 

Secondary "Jp in 1963 asaresult orju 

inlluenceinthe leftwinE^^ But “ ^"^Ljponship between 

,t appears Democrat vote be clarified by 

fluence on the education could prob y ^^^ 

the right wing and seco “’,1 'erven cultural level the 

data which we have nri^^^^j,,^, between the g In that the 

party press It cou ^gj,t wing e ® , ^nai situation favors 

conservative tradilm son " ^,gbt also confirm the 
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• moderating” role played by the nonparty redueauon 

InthecaseoftheChnstian Tn this area 

remains the central link of the chain leading to the 
It IS primary level of education, and not illiteracy o tradition 

which IS positively correlated with the modcrat P l„iEai 

This double component, primary education and m J^ustnal 

tradition, is the basis of the Christian Democrat vote 

Party organization, the nonparty press, and °'^^*"'?jEne 

are also important factors of the DC vote in the 

Television IS characteristically a negative correlation Wit t c 


The White Zone (Zone 11) 

The correlation circuits for this zone also show the 

from the national situation, although less important than o 
industrialized area They may be seen in Figures 12, 13, 1 
In this zone the circuit vanablcs that always favor the 
gressive tradition and percentage of voters — i^amtain their ^ 
with smaller indices In other words, there are some 
this causal chain whereby it can be more easily broken Even m 
ization does not exert such a strong direct influence on 
specifically the PSl 

In the case of the DC vote, however, the central link in t e 
primary education, is particularly strong in this zone But it vanes 
the national situation because of its connection with political 
Furthermore, unlike the national scale, it is negatively correlate 
illiteracy This charactenstic was also found in the 
m this case it was also negatively correlated with elementary edu 
The only positive correlation of the progressive political iradi lo 
with secondary education _ g 

In this area with its pronounced Catholic hegemony, the 
political tradition is filtered by both the absence of education 
primary level of education It is only at the level of secondary ' 
that culture does not act as a filler for this tradition In fact, this e _ 
tional level serves a role m the left-wing, particularly the PSI. 
Television also plays this role, as it has a positive correlation wiin 
progressive tradition in the left wing circuit 

11 should also be noted that in 1963, secondary education and ra 
television had the same position in the circuit leading to the right ^ ^ 
vote as they had m the circuit leading to the left-wing vote As n 
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a.eady been expired 

middle class character of ^ moderate character 

.rad, non ,s Jh ,e "one ,s du„e clear 

of the Christian Democrat electorate in me 

from the causal circuit 


„,e«edZone(Zo* //fl 

The circuits for Zone remembered that the PCI has a 

appraising these circuits it should W ^ 

particularly strong organization m ^ integration 

political participation , actor nluch could not be 

The presence of this '"■f^^ ^Vey to the interpretation of all 
quantified at the contmune level S t all the 

ftectrcuits It P™"'*" of elementary education Thi 

other factors, n.th the f ^ 

ts the only 'let"'"; .X lrrelatednith the Christian Democrat 

vote and, conversely, positisely „„,d, redly 

scale in favor “ agmlicance pj., organization 

indices and are “f ||rer represented by the 1 c g 

At this point the 1 nh « the pouW va 

can be clearly un ' draws into the , j forces such 

tradition and P‘‘'['‘?an indilferent '"11“'"“ Radio-television 

that are in Ihemscl ^aryeducation and already negau'c'y 

as.ndustnahzat.on,sccond^^^^^j "pLhe^more the 

also fits Chnstiao variable confirms this 

correlated with the professional other left ^Mng party 

correlation of orrelatcdncgamcly w' political and para 

situation, It IS e this is a ® 5,jre Federation of 

the PS! such as the hc 

political organiza 
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Figure 18 

Causal Models of CitRisnAN Democratic Vote 
ASD TitE Rightist Vote is 1953, in the "Red” Zone 
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Figure 20 

Cawal Models or Communist and Socialist 
Vote is 1953 is the “South’* Zone 
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Figure 22 

Causal Models oe CimisnAs Democratic Vote 
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Since 1946 Most of them, however, were before 1950, which was about 
the time the industrialization of the area began 

We find in this area that industrialization has a more direct influence 
on the vote and that it influences the PCI positively and the DC and 
Right negatively On the other hand, this variable has practically no 
influence on the PSl, it is not favored by participation, which is usually 
the case with the Left Other factors that are usually connected with the 
Socialist vote, such as primary and secondary education, also have a 
minimum weight in this circuit Progressive political tradition remains 
influential but decreasmgly so Although the paths that lead to the 
Communist vote are quite clearly indicated by the circuit, the Socialist 
vote seems to be somewhat more fluid and difficult to classify This gives 
us the impression that whereas the Communist vote is relatively 
consolidated, the Socialist vote is still in the process of consolidation 
With regard to the DC, elementary education remains the central Jink 
in the chain leading to the vole, and it is only through this factor that 
the negative influence of illiteracy, secondary education, and radio- 
television IS mediated and reduced The other link in the chain that 
reduces the negative influence of industrialization and thereby favors 
the Christian Democrat vote is the conservative political tradition 
The circuit of the Right is analogous to that of the DC, with the 
single exception that the role of elementary education m favor of the 
DC vote is assumed by secondary education in the case of the Right 

Sici!) {Zone K) 

The situation is different in Sicily Industrialization has had little direct 
influence on the vote, and this influence decreased between 1953, when it 
affected the PCI positively and the DC negatively, and 1963 Cor- 
respondingly, education has become a more important link in the chain, 
and It serves as a filler, as already observed on the national scale, for a 
variety of other factors illiteracy for the Left, elementary education for 
the DC, and secondary education for the Right This circuit, however, 
differs from the national circuit because there is a strong correlation 
between illiteracy and the lefl-wjng tradition ( 224 in 1953 and 326 m 
1963) As a result, illiteracy more frequently leads to a vote for the Left 
than was the case m the other areas of the southern peninsula and 
islands Conversely, secondary education clearly favors the right-wing 
vote, particularly in 1963, and exerts no positive influence on that of the 
left wing, even on the Socialist vote Not even the mezzadna serves as a 
link in the chain 

We also find that the Socialist vote differs the most in Sicily from 
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(he national situation aUo wiib regard to factors other than level of 
education In 1953 the \otcxvisposit.vcI> rtlalcd not on!> to the number 
cl toters winch is not uncommon hut also to the factor of farmers 
farming thc/r own land which indicates the moderate character tint 
socialism IS assuming In 1963 on the other hand the participation 
factor no longer entered mio the circuit leading to the Socialist vote 
I^n (his respect the f»SI was similar to the Right although the two could 
be diffcrenttated by means of other fictors such as political tradition 
the persistence of illiteracy and less extensive secondary education 
These factors accentuate iht populir connotations of the Soaahst 
electorate and perform a role contrary to that of the participation 
factor 


Sanfmia (/one I'/) 

fn this area we find that industrialization once again has a direct m» 
flucnce on the vote even though it decreased between 1953 and 1963 
This influence favored (he Left and worlcd to the disadvantage of (he 
DC and Right The only din’crcncefroniiheSouth lies in its influence on 
the Left and the Socialist vote AH other factors operated in their usual 
way Ihcmcj.flrfr/asupportcdthcCommimisi Left andfarmersworkmg 
their own land favored (he DC Elementary education remained (he 
decisive positive link for the Christian Democrat vote and a negative 
factor for the Communist vole Secondary education m a circuit in 
eluding radio television and participation remained the connecting link 
between industrnliration and the fight wmgvote Radio television also 
functioned positively m (he circuit that Jed to the Communist vote 
With regird to the PSi the positive role of progressive tradition and 
primary education in 1963 remained unchanged As in Sicily partici 
pation did not enter into the causal chain that led to the Socialist vote 
nor was television a decisive factor The consequences of this break in 
the circuit were reduced by both the positive direct influence of 
industrialization on the Socialist vote and the indirect influence of 
elementary education and ihe progressive tradition 


T/ie National Circuit and the Uea Circuits 
Now (hat we have examined the siiuatjons for the individual areas 
we can compare them with the situation on the national scale 
In all of Italy political tradition emerges as a decisive factor With 
the single exception of (he PSI m the red zone in which there was a 
Communist hegemony ,i serves as a link in the causal chain fnthiscasc 
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It was possible to collect most of the progressive political tradition under 
the PCI banner, all other political forces tended to be moderate With 
this exception, which is ncsertheless a significant one, political tradition, 
as It IS transmitted through the famil), leaders of opinion, and political 
or nonpohtical associations enters into the causal circuit that leads to a 
left-wing, either Socialist or Communist, vote 
The role of elementary education is also clear at both the national 
and commune level At the commune level primaiy education emerges 
as the decisive link in the chain leading to the Christian Democrat vote 
Although at the national level (which is based on provincial data, 
thereby attenuating local, regional, and areal differences) primary 
education appears to be a factor more favorable to the non Communist 
Left, there can be no doubt that at both the national and areal level this 
factor IS a link in the causa! chain that leads to the Christian Democrat 
vote Thus we can deduce (hat it is this level of education, particularly 
m the smaller communes which favors a Chnstian Democratic electoral 
orientation 

As one would expect, illiteracy acts m the opposite wa>, and it never 
enters the causal chain leading to the Christian Democrat vote It does, 
however, have a positive effect with regard to the Communist and the 
nghl-wing votes and, in Sicily, to the Socialist v ote Other characteristics 
in the chain however, vary from area to area 
The position of illiteracy is particularly important m the causal chain 
leading to the Communist vole in the industnal zone, Sicily, and 
Sardinia South of the peninsula, however, industrialization is of greater 
weight, in the white zone, the important link is the level of secondary 
education, and in the red zone, as may be expected, it is the organization 
factor Even for the Right the role of illiteracy vanes from area to area, 
depending upon the strength of the PCI organization Its weight is less 
in the industrial area, Sicily, Sardinia and also in the white zone, in 
which, as has been seen, it may tend toward the Chnstian Democrat 
vote 

With regard to the DC. secondary education plays an even more 
negative role than illiteracy, on the same level as radio-television 
Whereas illiteracy can be a filter m the white zone for even the DC vote, 
secondary education and radio-television are never links in the chain 
leading to the DC vote 

Secondary education and radio television generally perform a similar 
role but their influence on the left-wing or right wing vole vanes from 
area to area In the Veneto and the Islands these factors favor the right, 
in the industnal area they support the PSI Invariably they enter the 
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causaJ chain leading to the Communist 
Iitile influence 


Tote, but the> have relaflvcly 


From our analysis of Ihc corrclalions in ihc circuit, »c can malt 
certain conclusions about political forces The greatest ililfercncet from 
area to area seem to be icthm the PSI sshereas the DC oppeats to be 
the most coustanl The Right and the PCI are also relaliielv eonstanl, 
aUhough vsith certain areal diireecnees, primarily in the "ads meed ’ 
character of the tight-simg elecioralc in the uliile zone and the more 
“tttdastnal" character of the Communist electorate in Ihc south of the 
peninsula 


As we have said, ii »s the Socialist electorate that appears to be the 
most \aricd In the first two ureas the indusinai and white ^ones. it is 
typically industrial, cultured, and progrcssuc. in Sardinia it ts cultured, 
but in the red zone, it is moderate In the south of the peninsula and m 
Sicily It « d/ff/cult lo claKif). ss there are hnHs rhat are not always 
analogous in the c.iusal chain leading to the Socij1i>i vote 

One important element that plays an ambivalent role, however, is 
level of education U docs notact asa filter toward socialism in the south 
of the peninsula, and it fa\ors the Right ai the level of secondary 
education in Sicily 

With the exception of the orgamiation factor, which we were not 
able to examine, it is our general conclusion that the process of induv- 
Irializatiort influences the vole almost exclusively through the filter of 
the educational level The progression from illiteracy to elementary 
education tends to favor the DC to the disadvantage of the PCI and 
the Right There is also, however, (he filtering function performed b) 
political tradition, either moderate or progressive, which lerdt to 
Counterbalance the level of education At a further stage, the progression 
from elementary education lo secondary education, combined with the 
spread of mass culture, pnmarily through radio and television fends lo 
favor the Left, primarily its Socialist component, and the liberal com- 
ponent of the Right, to ihe disadvantage of (he OC In this cave as well, 
however, there will presumably bea limii loihecontratiion of ihe DC 
and the expansion of Ihe others through the filiering function pefformed 
by political tradition 

On the whole, the basic causes thac sinentatc the Italian flfctoraie 
seem to balance and compensate for one another This may be seen in 
terms of both space (m areaJ differences) and time (the prewms of 
industrialization and the mcrcave of culturcl Thi* halanong erect 
favors an increasingly stable frahan electorate 
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which utilizes, on the one hand, a nominal vanable, the region, and 
holds constant, on the other hand, different t>'pes of social structure 


CONSTRUCTION OF THE FACTORIAL PLAN 

By means of co\anance analysts the “disturbing” effect of a vanable 
can be eliminated It is thus a sophisticated technique of investigation 
Nevertheless, it has rarely been employed in electoral soaology be 
cause the quantified data are not always appropriate to the construction 
of an orthogonalized plan, and the statistical distributions are often 
“abnormal We have, however, a sufficient number of cases (2,477 
cantons) to surmount the difficulties 

First, we have retained as independent variables the proportion of 
workers in industry and the proportion of electors belonging to the 
middle classes After a cross tabulation between the two variables, 
we have drawn for each of them two cut points, which permit us to 
distinguish a total of nine categories of cantons These cut points have 
been determined in such a way that the number of units is “maximized” 
for the less numerous categories From this we have obtained the 
distribution shown in Table 1 

From each category 100 units were randomly sampled, and for each 


Table I 

DisTRiBunoN Obtaived in Q)variance Analysis of 
French Euctoral Data for 1956* 


Proportion of Hectors 
from Middle C/ajj« 

Proportion of Industrial Workers 

Less than 

Betv,een 

15 J and 22 4yi 

More than 
22J% 

Less than M 9% 

684 



Between 21 and 24 4% 

180 



More than 24 5% 

106 

230 

725 


cF™7nr.l,T U or the 3 000 cantons in France 2,477 were 

or moro Uou. 20^ «»3' ‘I" "»■ “"U'" 


fcr K^nance a’uTjw to apply certain cooditiont most be salisfied 
Sotiahsi, J ^ Frantatir * 
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proportion of workers in industry Hnce'h P P^^^,^ pf 

inhabitants constitute, m the * . ,4 categories of canlons and 

urbaniaation We could f reby """'P > ™ .^fs »e can espect that 
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the classification presented in Table ^ pp„,„„s, less 
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furnished a complete analysis of the variance of the religious factor and 
the political vote, and the covanancc between the two 


THE "RE-EMERGENCE" OF SOCIAL CLASS BY 
COVARIANCE ANALYSIS 

By the Fttsi (the relation between the variance of each variable and 
the residual variance), the different covariance tables indicate the actual 
effect of each independent vanabic, observed after the elimination of 
the religious factor 

It has been confirmed, for rural France in its entirety, that the pro- 
portion of electors belonging to the middle classes has no tangible 
effect on the vote Thus, as the effects of religion have been neutralized, 
the “middle class” variable does not appear to have any more weight 
than in the regression analysis This is, without doubt, due in part to the 
imprecise character of this variable, referring as it does to a socially 
heterogeneous population all the social categories with the exception 
of the industrial workers and persons occupied in agriculture 
On the other hand, covariance analysis indicates for the working 
class a degree of influence which was not fully recognized by multiple 
regression analysis All of the Snedecor F are significant, and they are 
higher for the Communist vote than for the group of leftist parties 
Covanancc analysis does not contribute anything really new on this 
point, but it assigns a greater significance to such phenomena than does 
regression analysis 

By regression analysis we have obtained a correlation coefficient of 28 
between the percentage of industrial workers and the proportion of 
Communist votes in the legislative elections of 1956, a coefficient of 21 
between workers and Communist Socialist votes, and a coefficient of 
13 between workers and votes for the Uft (Communist. Socialist, and 
Radical) For the covanancc analysis the F is respectively 19 6, 9 79, 
and 3 90 k ^ 

Covariance analysis has permitted not only a clearer perception of 
the role of the working class m the determination of the vote but also 
the measurement of the difTerentiated influence of this variable on the 
Communist, Socialist, and Radical votes 

The effect of the interaction of the two variables (workers and middle 
classes) cannot be established at the level of rural France as a whole 
If the combination of these two vanables has no influence on the vote, 
one can rightfully conclude that the variable “population occupied in 
agriculture, which is the complement of the other two, also does not 
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CKrcise any perceptible influence on ibe political altiiute al ibe level 
of aggregate data on the national scale (The itifluence of the religious 
factor has, of course, been neutnlized ) 

It IS by introducing tlie nominal variable "region that covanana* 
analysis assumes its greaicsi significance In elTcct. ‘ region ' influences 
the three dependent variables much more than the working-class 
variable This result bringv out clearly the importance of social contest 
It has been found that the influence of the workingclass proportion 
IS not only weaker than the influence of region but also, for each region, 
weaker than the weight of ibe worjeme class variable established for all 
of rural France In oiher words the proportion of workers influences 
the proportion of Communist Socialist or Radical votes less in the 
regional contexts than for all of France In the regional framework, the 
working class exercises a greater influence on the combined Communist- 
Socialist vote than on the Communist vole alone In compensit/on, the 
effect IS ni! for the total of leftist votes This last fact is easily explained 
by the great number of small farmers and electors belonging to the 
middle classes who vote Radical 

The interaction of the working-class and region variables exercises 
an influence on the Communist and the Left votes This signifies that 
in addition to the respeclivecfTecis of region and class there is a specific 
eflect that results from the interactionof iheseiwo variables 

In short if the religious factor is held constant Jhe regions tnaintam 
their distinct characters a fad which demonstrates very well the 
importance of social and political context 


CRAPlUC REPRFSENTATtON OF THE 
COVARIANCE ANAUSfS 


Covariance analysis permits ihc calculation for each of the dependent 
polilical vanablK and in eacli of Ihe cattgoncs dislinguisbcd by Ihc 

plan of analysis a new mean that one can call a corrected mean after 

the elimination (or rather the neutrahmteon) of the effect of religious 


^*^For each of the dependent political variables it is possible to represent 
graphically 

1 The eflbcls of each of Ihc I«o indcpcndcm vanabics retained in the 
factorial plan before (that is. al the leiel of variance) and alter (at the 
level of covariance) the nealrali/atton of Ihetelisioav vanable 

2 The clfecls of the loo variables «ith and vvilhoul the rcliEinov 
factor There is no doubt that the consltuelion of such eraphs is 
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meaningful only if one can establish by means of covariance analysis 
a significant effect for one of the two variables or their interaction 
Those Significant results, tested by the F of Snedecor, are reassembled 
in Table 3 

The resulting statistics confirm our hypotheses. 

1 Liberated from the effects of the religious “supervanable,* social 
class intervenes in a very positive manner for rural France as a whole 
This is just what we expected In effect, regression analysis poses a 
problem Thus, for the Communist vote we have been able to establish 
that the multiple correlation coefficient with the independent variables 
of “religious practice” and “industrial workers” was 64, whereas the 
correlation coefficient with the single religious variable was 59, and it 
fell to 25 for the ‘ w orker” variable This divergence was venfied for all 
the political variables Apparently, the preponderance of the religious 
factor concealed, at the ecological level, the effect of social class 
Although the correlation coefficients between these two vanables am 
weak, one can suppose that there is a certain interaction between them 
which produces a confusion of effects 
By holding the religious factor constant the vanations in the vote for 
the different proportions of workers are accentuated Certainly the 
differences between the results of variance analysis and those of 
covanance analysis are not considerable, but it is necessary to emphasize 
that even before the religious variable was neutralized, the vanations of 
the vote according to the proportion of workers was also weak They 


Tables 

Covariance F Test 
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I960 

9 79 
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20 (M 
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Intcniaion 

4 30 


6 76 


^ CoTimuniii Pany and SFIO ihe Soctaltsi Party in France 
Communiit Sooalist and Radical Parties 







COMMUNIST .rr COMMUNIST 

VOTES AND SOCIALIST VOTES 




COVARIANCE ANALYSIS OF FRENCH ELECTORAL DATA 


293 


ferences are significant are ncrceptibly the same for 

11 should also be noted llial ,,a,|rjrs, the 

the three political variables or 

-corrected" mean is ahvays weaUr than the^^ Communist. Corn- 
words, with “religiosity equah th P J ,1^ workers is 

munist-Socialist, or leftist '“f J “fheavily on political be- 

clearly weaker The strong ^t constitutes a dam for the 

havior: where religious pra political tendencies ate free to 
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scarcely intervenes for (he North, the region of moderate religious 
practice, and it Significanlly loweti the vote for the parties of the Left 
in the dechnstianized Center 

In other words, if the level of religious practice were the sams for all of 
F ranee, the West would vole more for the leftist parties and the Center 
less, but the two regions would remain politically distinct 

In the regional framework, social class clearly plays a lesser role if 
religion is eliminated In other vvord», it seems that in the difTercnt 
regions, there was a very clear mteraciion between the religious factor 
and the social class factor The social variable has its role, but to a large 
degree only if the reUgiom variable permits in this sense, rehgion serves 
as a kind of catalyst 


Table 4 


Social Class and Political Parties 
Covariance Analysis for Entire Rural France 

(Efeclions of I9S6) 


/ Variance of Religious Protuce 




Source of 
Vanauon 

Middle class 

Workers (industrial) 
Interaction 

Residual 

Degrees of 
Freedom 

2 

Z 

4 

891 

5iifn of 
SQuores 
13,304 
4,735 

2 053 
382,341 

Mean 

Sguares 

6.652 

2 367 

513 

429 

F 

15 5* 
551'‘ 

1 19 

Toial 

899 

402,435 

447 


2. Variance of the Communist Vote 




Source of 
fimwwr 

Middle class 

Workers (industrial) 
Interaction 

Residual 

Degrees of 
Jiwdom 

2 

2 

4 

891 

Sum of 
Sguorcs 
t.200 
1,001 

89 

107,250 

Mean 

Squares 

600 

500 

22 

120 

F 

4 93** 

4 !5‘*« 

I 

Total 

899 

J09.542 






• S;gnif$cant at 1 per 1 OOO 
•* Significant at 1 per cent 
Si^ificant ai 5 per cent 
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Tahh 4 (continued) 


3 Co\ariance between Religious Practice and the Communist Vote 


Source of 

Degrees of 

Sum of 

Mean 

F 

Variation 

Freedom 

Squares 

Squares 


Middle class 

2 

265 

132 

I 642 

Workers 

2 

2,720 

1,360 

169* 

Interaction 

4 

267 

66 

1 

Residual 

S90 

71,760 

80 


4 Variance of the Communist and Socialist Vote 



Source of 

Degrees of 

Sum of 

Mean 


Variation 

Freedom 

Squares 

Squares 


Middle class 

2 

1,658 

829 

3 37*** 

Workers (industrial) 

2 

1,048 

524 

2 13 

Interaction 

4 

115 

28 

1 

Residual 

89) 

218.818 

245 


Total 

899 

221.639 

246 



5 Co\arianee between Religious Practice and the Communist and Socialist Vote 


Source of 

Degrees of 

Sum of 

Mean 


Variation 

Freedom 

Squares 

Squares 


Middle class 

2 

215 

107 

1 

Workers (industrial) 

2 

3,478 

1,739 

9 79* 

Interaction 

4 

342 

85 

I 

Residual 

890 

159,075 

177 



6 Variance of the V ote of the Left (Communist Socialist, and Radical) 


Source of 

Degrees of 

Sum of 

Mean 


Variation 

treedom 

Squares 

Squares 


Middle class 

2 

2,811 

1,405 

4 I9*** 

Workers (industrial) 

2 

205 

102 

1 

Interaction 

4 

393 

98 


Residual 

891 

298,798 

335 


Total 

899 

302,208 

336 



7 Co rar lance bel h een Religious Practice and the Vote of the Left 
Source of Degrees of Sum of Mean 

ranation Freedom Squares Squares 

Middle class 2 192 96 

Workers (industnal) 2 1 649 824 

Interaction ^ 177 

890 187,888 211 

• Significant ai 1 per 1 000 
•• Signifkanl at 1 per cent 
SigmScant at 5 per cent 
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^ Table 5 

-M CLASS POLmCAl PARTIES AND REGIONAL CONTECT 


S»«»/ A'""' 

Fnrtiwi 


IS 

2 

4.852 

rs 

4 

1 236 

:tion 

nl 

261 

46.558 


269 

110.681 

nance 


Sum of 

tree of 

Frectlom 

Squares 

nalion 

2 

18,599 

.ions 

2 

3,164 

~ rkers 

4 

902 

-“eraction 

261 

16.985 

•sidual 

269 

39.650 


29,016 

2426 


162 66* 
13 60* 
173 


Squares 

9,299 

1.582 

225 

65 


142 89* 
2431* 

3 46** 


Co^armcebcl>^cer, J 


Source of 
yarlaiion 
Regions 
Workers 
Inleraction 
Residual 


Freedom 


Squares 

1.781 

1.005 

790 

IJ.940 


Squares 

890 

502 

197 

45 



• Significant at 1 P«r ^ 
•Significant 

• Significant at 5 per 


19 39* 
10 94* 

4 00** 
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Table 5 {continued) 


5 Cotariance between Religious Praciux and the Communist and Socialist Vote 


Source of 

Degrees of 

Sum of 

Mean 

F 

Variation 

Freedom 

Squares 

Squares • 


Regions 

2 

8,807 

4,403 

50 84* 

Workers 

2 

1,997 

998 

II 52* 

Interaction 

4 

593 

148 

I 71 

Residual 

260 

22,518 

86 



6 Variance of the Vote of the Left {Communist, Socialist, and Radical) 


Source of 

Degrees of 

Sums of 

Mean 

f 

Variation 

Freedom 

Squares 

Squares 


Regions 

2 

46,691 

23,345 

144 03* 

Workers 

2 

1,859 

929 

5 73** 

Interaction 

4 

4,273 

1,068 

6 59* 

Residual 

261 

42,302 

162 


Total 

269 

95,127 

353 



7 Co\ariance between Religious Practice and the Vole of the Left 


Source of 

Degrees of 

Sums of 

Mean 


Variation 

Freedom 

Squares 

Squares 


Regions 

2 

4,953 

2,476 

20 04* 

Workers 

2 

237 

118 

1 

Interaction 

4 

3,346 

836 

6 76* 

Residual 

260 

32,131 

123 



• Significant at 1 per 1,000 
•• Significant at 1 per c«nt 
Significant at 5 per cent 





PART rOUR 


Factor Analysis and 
Ecological Typologies 


rftjf part goes et)e step further tn techmea) rejietement onJ presenes a nrrief^ of 
txampfes of the use of fcaor ooaHjsis in the hondhttg ef multiple <laea files far 
tertltonai uniH Techniques for the rxiroetion of rommon factors from multtmriate 
forrelation matrieei were ortginaUj JerelopeJ fy Jifiereniwl psyrhologiui but 
have since been founj usful m the treatment of o<*tJc tarle^ f ether social seienee 
data. In his opening paper the 5u^yl(^ ecologist Carl Gunnor Janton sheus the 
eslraordmarjf range of ukj found for these leehmques In i/t« study of urbon strue 
tura This ts the field ithere the foetor onalytle techniques hare so fat prosed most 
useful in the mapping of dimensions of eeologital tariation Jansen Joes, it ft truf, 
pome ea a feu factor analjtie studies of lanattons aeross mJer ranges of heal 
commatsities — rural, suburban, as well as urban — bat so far most of the aork in 
this field has made use of simpler statistical procedures sack as regression analysis 
and anaij'sit of tarlancc U is rnJceJ significant that all the etamphs if tcohgicol 
foetor onoijsis assembhd in this rahme focus on sanations among areas njtiim 
metropolitan cities the arailabihi/ of esienstse files f statistics for ser/ small 
units of lerriiorj' ofers great temptations to tfic factor anal) a and opens up 
Intriguing poBjii/iiiw of theory wwirocmwi Three of the papers deal nUli one 
f the largest mtffropo/iian arcat in the uorld Greater lonjon The geographer 
kerin Cox analyzes the </(«r»6ofi(Wi of wees in ehe UmJon suburbs, and the 
sociologist Peter jVirrman and the nwfAoniififMB fxigd Houard repirt on progress 
in the Jcyelapmcnt of factor analytical typologks f census tracts m lonJan The 
fourth paper, by the American urban loeiol^ist Frank bytectser, opens up a 
comparathe perspccenc he reports on a study f the ceologicol structure oftno 
hisraneally and topographical}/ ttiy Jijcrent cities. Boston and Helsinki, and 
points to Interesting similarities and contrast* 

799 
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In the discussions oj these papers at the ^mpostum at Evsan, a number oj parti- 
cipants feh re^ unea^ about factor analysis as a method one if them ncnt so far 
as to characterize factor analysis os a *^sleJgebammer approach,” while others 
called for further tests of the interpretation of the factors extracted through the 
rotation of the matrices 

There kos consensus, honerer, that factor analysis constituted a poneful tool 
in the initmi mapping f the cotanotions in large data fles and could serve 
admirablj in the construction of typologies of ecological contexts in fact, there nas 
general agreement that principal component procedures of the type used in the 
London study ought to be inlroi^ucct/ nherever possible as a basis for the 
construction of sampling frames for '^ecological surveys ’* 

We are still far from any general consensus on optimal strategies in ecological 
research The factor analysts hate made important contributions and will, no 
doubt, go still further as the large scale computers allon more and more complex 
analyses But it is also clear that the results of factor analysis mil be caster to 
interpret if th^ are subject to farther treatment through broader batteries of 
statistical techniques Kcun Cox's study is of partievlor interest in this perspective 
hefrst proceeds by factor analysis and then checks some of his findings through the 
testing of a causal model along the lines of the Blalock Boudon schema This would 
seem to be a procedure well worth testing on a vafieiy of ecological data 
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Some Problems of 
Ecological Factor Analysis 

/ brief REVIEW OF FACTOR ANifLYSESm 
HOHAft ECOLOGY 
1 I /nfroitucUon 

Recently factor analysis has been much in demand withm human 
ecology and has almost become a controversial fad Although there are 
a few early examples, even before World War II, the bulk of ecological 
factor analyses belong to the latest ten or fifteen years 
Naturally, the short review of ecological factor analytical studies 
given here t$ intended not as a comprehensive bibliography but as a 
highly selected senes of iKustradvc examples the selection being made 
With a heavy Scandinavian American sociological and egocentric bias 
However, the review js intended lo illustrate the kinds of applications 
of factor analysis to ecological problems and the kinds of results they 
have produced The technical solutions employed and the problems 
involved will be discussed in later sections 
From the areal units the analyses are divided in two categories 
(1) those concerned with variations within communities, actually intra* 
City variations that is aspects of urban spatial structure, and (2) those 
concerned with variations among communities or other units such as 
communes, counties, and countries that is. intercity variations and 
regional structure in a broad sense Within each category studies may 
concentrate on a specific variable or aspect or be of a more general 
kind 

It should be pointed out that “factoranalysis" m this paper is taken 
to include also “component analysis’ 

301 
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1 2 Factor Analyses of Urban Spatial Structure 
121 Studies Concentrating on Specific Aspects Some factor analyses 
of spatial urban patterns concentrate on, or pay special attention to, a 
specific variable or aspect, often delinquency and crime, following an old 
sociological tradition However, also other aspects have been treated 
Back in the 1930’s Gosnell and Schmidt (1936) made a factor analysis 
of 17 voting or environmental variables m Chicago Later Holzinger 
and Harman picked 8 of the variables of this study to give one of their 
well known examples, in this case of the pnnapal factor solution, with 
“Traditional Democratic Vote** as the first of two principal factors 
(Holzinger and Harman 1941, pp 177-179, oblique solution on pp 
260f,cf Harman 1960, pp I77-I79, 287 f , 332 f ) 

A recent example of an ecological factor analytical study of voting 
within an urban community is provided by Cox (1966) He was inter- 
ested in the political dififerences between suburbs and central city He 
used 20 demographic, housing, socioeconomic, and migration vanables 
assumed to be politically relevant for 69 areal units of the London 
metropolitan area in 1951 In a preliminary stage to further analysis 
Cox extracted 4 principal components accounting for 89 2 per cent of 
the total variance of the matrix After rotation to new orthogonal 
dimensions according to the vanmax principle, the factors were named 
“Social Rank,” ‘•Suburban-Central City,’* “Commuting,” and “Age ” 
Measures of them were computed and used as vanables in a linear 
regression causal model of the Simon Blalock type (cf , e g , Blalock, 
1961, Chap 3) together with “Participation” and “Conservative Vote” 
variables, which were not included in the matrix analyzed 

In 1964 Wiseman and Warburton reported three factor analyses of 
educational and environmental variables from 1951 and 1957 in the 
Manchester area, using wards or schools as units The first analysis of 
the 1951 data in 26 wards in Manchester used 6 educational and 10 
environmental variables Four orthogonal factors were extracted and 
rotated according to the quartimax method After rotation they con- 
tributed 32 5 per cent, 19 1 per cent, 9 3 per cent, and 8 4 per cent, 
respectively, of the total variance The first three factors were interpreted 
in order, as a genetic (jicO factor of social “Disorganization,” “Eco 
nomic Background.*’ and “Lack of Parental (or Maternal) Care ” Three 
measures of intellectual and educational backwardness were heavily 
loaded on the first factor The second analysts used the same 6 educa- 
tional variables and 6 environmental ones for 48 schools in Salford 
It resulted in three rotated orthogonal factors, contributing 24, 18, and 
7 per cent, respectively, of the total variance They were interpreted as 
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but this time “Juvenile Delinquency,” which was included in both sets, 
belonged more to the socioeconomic group of variables (Janson, I960) 

In Helsinki, GrOnholm (1960) analyzed seven indices of disorganiza- 
tion and SIX other background variables with the 21 city wards as units 
By the centroid method he got three factors, which he rotated graph- 
ically to new orthogonal factors, covering 71 per cent of the total 
variance Juvenile delinquency had some loadings on each of them 
The highest one, although rather modest (0 59). was on the first factor, 
which GrOnholm interpreted as an “Urbanization” or “Metropolitan 
factor, not to be confused with the Shcvky-Dell “Urbanization” but 
here connected with shunning of work, suicide, mental disorder, non- 
residential buildings, loncitncss, and divorce The second factor was 
clearly connected with “Socioeconomic Status,” while the third one was 
ambiguous, with loadings on loneliness, poverty, and divorce 
Ecological studies of delinquency and crime usually deal with the 
residential distribution of the offenders, not with the distribution of 
places where offenses were committed Recently, however, Schmid ( 1 960) 
analyzed the distribution of crime against property, including robbery, 
and of suicide, according to places of occurrence, together with the 
residential distribution of drunkenness, disorderly conduct, vagrancy, 
and sex offenses His data refer to Seattle around 1950 He used 20 
crime variables and 18 social, economic, and demographic variables 
for 93 census tracts Eight principal factors were extracted and rotated 
orthogonally They accounted for 86 6 per cent of the total variance 
One factor was ambiguous Six others were interpreted by means oT the 
“background” variables, although two of them, “Low Family and 
Economic Status” and “Population Mobility,’ had their highest weights 
on crime variables Residential crime variables dominated the former 
factor, occurrence variables the latter one Two occurrence variables 
had substantial loadings on the “Race” factor, and there were scattered 
crime weights on “Low Family Status” and the ambiguous factor, 
whereas “Low Occupational Status” and “Low Mobility Groups” had 
small crime weights only Finally one factor with three crime weights 
had no weights on background variables and was labeled “Atypical 
Crime Pattern ” 

Boggs (1965) analyzed 17 crime occurrence and criminal offender 
rates in St Louis (1960) She discussed four orthogonal vanmax 
rotated factors which accounted for 70 per cent of the total variance 

122 General Factor Analyses More general factor analyses have also 
been published Some of them have taken the Shevky-Bell conceptual 
scheme or typology as theirpomt of departure (The Shevky-Bell scheme 
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was first publishcJ m Shevkyand William-!. 1949, for a later !crsion 
see Shcvky and Bell, J955) The scheme was supported by a cluster' 
analysis of the San Francisco Bay Rcgio/i by Tryon (1955) and by 
factor analyses of the same region and of Los Angeles by Bell (1955) 
Both analyses of the seven component variables resulted in three 
oblique factors with correlations consistent with the scheme One factor 
had considerable loadings on the three components of ‘economic 
status'* bat small ^^elghts on the other variables, the others had high 
loadings only on the family suius" components and the “ethnic 
status’* measure, respectively Thus they were easily identified as the 
three dimensions of (he scheme 


Anderson and Bean (1961) analyzed the six components, rent ex- 
cluded, of the three dimensional measures together with seven other 
^a^ab]es in Toledo, Ohio, uith 55 census tracts as units They extracted 
four centroid factors which after rotation to simple structure ucre 
interpreted according to theShevky Bell scheme with the exception that 
“urbanizatjon** was divided into two dimensions, called “Urbaniza- 


tion’* and “Family Status “the latter with a high weight on the fertility 
ratio only The factors together accounted for 82 6 per cent of the total 
variance and were kept orthogonal, as oblique rotations were no 


improvement 

In another study the six componentsof the scheme’s three dimensions 
rent again excluded, were factor-analyzed in ten cities The multiple 
group method was used for factorizing For each city the factor matrix 
was rotated to a least squares approximation of a matrix according to 


the Shevky Bell scheme In four Southern or border state cities Per 
tihty’* could be equally well or better fitted to “Social Rank ’ as 
compared to "Urbanization “ In Providence, the same was true for 
“One Family-Dwelling Umrs ** Moreover, the correlations between 
“Segregation*’ and the other oblique dimensions varied somewhat 
between cities (Van Arsdol Jr , CamiHen, and Schmid, 1958) 
Addition.!! studies have approached the urban structure m other 
frames of references, such as Schmid and Tagashira (1964) m analyzing 
Seattle Sets of 42. 21, 12 and 10 vanables viere used, a larger set 
containing all smaller ones A considerable amount of redundancy in 
the largest set was gradually reduced tn the smaller sets Various 
numbers of principal factors were extracted and rotaicd according to 
a modified vanmax criterion Eight factors of the largest set accounted 
for 88 0 per cent of the total variance, but not all of them were easily 
mternreied The first three sets yielded four clearly defined faaors 

“Family Status’ ••EconomicStatus,’”’EihnicSfatu5.“and“Maleness.“ 
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of which the latter was eliminated m the fourth set Other factors varied 
or were not easily mtcrpretablc finally, measures of the three mam 
factors were computed, mapped, and discussed 
Sweelser analyyed Helsinki and Metropolitan Boston thoroughly 
and reported the results in a senes of papers He used principal factors, 
but computed them with communahiy estimates, and varimax rotation 
to new orthogonal dimensions 

First, sets of twenty \ariablcs were prepared in Helsinki with 70 
areal units and in Metropolitan Boston with 441 units and made as 
similar as possible A second set in Helsinki had 6 variables in common 
with the basic Finnish set and 14 substitute variables The two Helsinki 
sets resulted m three dimensions that, in order of importance, were 
labeled “Socioeconomic Status “Progeniturc” or ‘ Young Familism 
and “Urbanism” in both cases The loadings were very similar for the 
sets, with coefficients of congruence (Harman. 1960, p 257) of 0 995, 
0 933, and 0 902, respectively, for the three factors Also the Boston 
matrix produced three dimensions, which were correspondingly named 
The congruence between the Helsinki basic set and the Boston set 
was substantial but clearly less than between the two Finnish matrices, 
particularly in the loadings of “Urbanism,” with coefficients of con- 
gruence of 0 744, 0 792, and 0 480, respectively, for the factors In 
Boston, “Urbanism” was associated with surplus of men and with 
housing defects but in Helsinki with surplus of women and with lack of 
housing defects (Sweetser, 1965a) 

Then, in Helsinki a set of 42 variables, containing 17 of the basic set 
variables, was compared with the basic set After the rotation of seven 
principal factors the six strongest ones were interpreted They accounted 
for 76 4 per cent of the total variance of the larger set “Socioeconomic 
Status remained the most important dimension, “Progeniture” the 
second strongest with approximately unchanged loadings “Urbanism” 
now corresponded to a “Career Women” dimension A fourth dimen- 
sion of Residentiahsm” was similar to a possible fourth dimension of 
the smaller set of variables Coefficients of congruence for the four 
factors were 0998, 0932, 0931. and 0826 Two dimensions from the 
later stages of the family cycle, “Established Familism” and “Post- 
geniture,” were added (Sweetser, 19656) 

In Boston, a 34 variable set was analyzed It was similar to the larger 
innish set but had less redundancy and included some variables con- 
cerning families, day population and land use were excluded, but 
measures of income and migration were added Again six dimensions 
were taken out, with ‘ Socioeconomic Status” as the most important 
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one. The next to strongest factor %as interpreted as “Famihsm ” There 
were also two dimensions of migration and two ethnic ones called 
Irish middle class” and “Nonwhit^Jialian (Sneetser 1965c) 
Companngthe large-set anaI>scs,Sweciser finds matching dimensions 
of sociocconomicstatus child centered famiiism and residential status 
The Boston ‘Tamilism” factor then corresponds to “Progeniii/re** and 
Established Famihsm,” the Helsinki ' Residcntialism” fjcioriotheiHo 
migration factors There are also distinctise factors ‘ Career Women” 
and ‘‘Postgeniturc’ m Helsinki and the ethnicity dimensions m Boston 
(Sweetscr, 1966) 

Factor scores of the six Helsinki factors were computed and their 
spatial distribution examined A mainly sectorial pattern was found for 
"Socioeconomic Status, * mainly zonal patterns m the others with sec 
tonal tendencies also for ‘ Progemture * and ' Residentialism" (Sweetscr, 
1956) 

As seen in his paper m this solumc Sneetser (1968) also studied the 
factors involved in zones and sectors of Helsinki and Boston In an 
earlier paper dilTefcntial change in importance of factors in suburb and 
core also was studied for Boston 195l>-l960(Swceiser. 1964) 

In Copenhagen. Pedersen analyzed sets vvjih 14 variables each, re- 
ferring to 1950 and 1950 Three orthogonal dimensions interpreted as 
"Urbanization.” “Socioeconomic Status “ and "Growth and Change" 
were found to account for S4 to 92 per cent of the total variances of the 
matrices, partly because of a limited and somewhat redundant selection 
of variables (Pedersen, 1967) 

In a study of the Cny of Newark. New Jersey, six orthogonal (prin- 
cipal) dimensions accounted for 71 4 per cent of the total variance of 
48 demographic, social, economic, housing, location, and land use 

variables Aftervanmaxrotationlhcstrongestdimension wasinterpreled 
as a “Segregation” or “Racial Slum" dimension, another one as 
“Social Rank,” whereas two further dimensions were related to the 
middlestages of the family cycle and tentatively called “Residentialism” 
or “Famihsm" and “Middle Age " Two weak dimensions were labeled 
“Mobility" and “Density ” Factor scores were computed Within the 
central city the famili>m, social rank, segregation, and density dimen 
sions showed concentric tendencies, whereas sectorial tendencies were 
seen mostly m the social rank dimension but to some extent also in the 
segregation, middle age, and density aspects of the structure (Janson, 


Several factor analytical studies of the urban spatial 
cently been reported or are under way, for example, of Manhattan 
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(Carey, 1966) The Center for Urban Studies in London has been analyz- 
ing London and other British cities according to the 1961 Census of 
Population and Housing as could be read in Norman’s paper in this 
volume (Norman, 1968) The component analysis used in a preliminary 
step toward a multiple classification of urban areal units means a 
further development and an application on another problem of the 
classificatory techniques referred to here in the next section (see also 
Scott, 1964) The British Inter Universities Census Tract Committee 
has CO ordinated factor analytical and classificatory studies of other 
Bntish conurbations, such as Southampton, Merseyside, and South-East 
Lancashire (Gittus, 1964) 

The biggest Swedish cities are now being studied by the Swedish 
Building Research Institute, Stockholm 

1 3 Intercity and Regional Factor Analyses 
1 3 1 Studies Concentrating on Specific Aspects Just as some analyses 
concentrated on the intracity variation of specific variables, the variation 
of special phenomena between other areal units has been studied by 
factor analytical methods Perhaps Hofstaetter’s (1952) reanalysis of 
Thorndike’s Your City (I939)can be included among thist>peof studies, 
as Its 23 variables concentrated on the specific aspect of “goodness” or 
“moral integration” in intercity variation Of works using other units, 
Caltell’s study (1949) of nations may be mentioned 
Olsen and Garb (1965) selected ten American states with a relatively 
high level of per capita income, industrialization, and technologically 
advanced agriculture for a factor analysis of 49 variables, mostly of an 
economic character The analysis was compared with another one of the 
same variables for the ten states of the Amencan “South,” thus pro- 
viding a means for describing the South m comparison to the ten first 
“standard” states Five centroid factors were used in the compansons 
after quartimax rotation In the standard states “Urbanization” and 
Suburbanization” were the most important factors, of which the sec- 
ond one was virtually absent in the South There “Urbanization* and 
a factor representing the presence of a lower-income group of unskilled 
labor were the strongest factors 

Some regional factor analyses of the specific aspect type specialize in 
political behavior Recently the factor analytical approach has been 
tried out rather extensively in this field As non-Amencan examples, 
the nnnish studies by Allardt (1964a and b) may be mentioned There 
the regional distribution of Communist votes was analyzed with com- 
munes as units The communes were divided into five categories Cities 
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and towns constituted one category, whereas the rural communes made 
up the other four types those with Swedish-speaking population, 
southern and western communes eastern communes, and communes 
tn northern Finland For each set of communes 37 normalized social 
and poliijca! variables were analyzed Seven principal factors were ex- 
tracted in each case and rotated according to an oblique method 
invented by Ahmavaara and Markkancn Also, centroid factors were 
computed and rotated according to the varimex principle to new 
orthogonal factors The two solutions gase mutually consistent results 
In the North, but not m the other areas, Communist strength was found 
to be associated with social change and insecurity This result supports 
the View of "Backwoods Communism’ as a reaction to alienation 
(For Finnish regional studies see also Allardt and Riihinen 1966) 

In Sweden, Janson (1961, pp 2(6 f ) used the 28 Swedish constitu- 
encies in an analjsis of Social Democratic and Communist vote in 
1948 and 20 normalized social economic, and demographic variables 
by means of the centroid method Three factors were extracted and 
slightly rotated 

Original approaches arc provided by Alker, Jr (1964) and by 
MacRae and Mcldrum (1968) m this volume Alker analyzed roll calls 
in the UN General Assembly 1961-1962 separately for economically 
developed countries and underdeveloped countries and for all countries 
together (See also Alker, 1966 ) MacRac and Meldrum use deviations 
from election and county means in various Illinois elections as variables 
m principal factor analysis on a county basis to reveal “critical elec- 
tions ” 

1.3 2 Genera! Factor Analyses Another senes of studies, however, 
aim at a general descriptive and/or cJassificatory scheme for areal units 
with the factors as basic descriptive dimensions Often the units are 
cities, for which lacior analysis offers an alternative approach to the 
usual aprioristic, geographical, and economic classification systems for 
cities according to fiincliorja) criteria 
As earlv as 1942 Price amlyzed 15 variables for 93 cities Applying 
Thurslonc's centroid method, he extraeted four faclors. which afier 
rolnhon were interpreted ns “Populnhon Size.” ' Occupational SIruc 
tore” (service center if there is no industry). "Letel of hiving, and 


“Per Capita Trade Volume ” , 

Kaplan (1958) sampled 370 out of dSd cities with over 25.000 popula 
non ,n the United Stales Two analyses i»re made 
centroid method Fitst. 10 variables expressed in absolute 
lhat IS. not proporlions. means, or ratios -were analyzed toeether 
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vMth 22 vanables expressed as relative numbers Five factors resulted, 
the first of v>hKh was obviously a “Total Population Size” dimension 
Then “population size” and the 22 relative variables of the first set were 
analyzed together with 24 other vanables of the same relative type In 
addition to size the variables concerned demographic, social, economic, 
educational, locational, occupational, housing administrative, and 
other conditions Four factors rather closely corresponding to the four 
remaining dimensions of the first anal>sis resulted They were labeled 
‘Socioeconomic Status,” “Population Stability,” “Ethnic and Racial 
Homogeneity and Strong Residential Area Onentation,” and “Age- 
Sex Structure” dimensions Kaplan then went on to measure the dimen 
sions by means of simple scales with the highest loaded vanables as 
items, thus he could list the cities characterized by the various combina- 
tions of factor scores Many categones of the list showed marked 
regional homogeneity 

Hadden and Borgatta (1965) carried out eight analyses of 644 out of 
674 cities of the United States with 25,(X)0 or greater population in 
I960 The 65 vanables measured population size and changes, other 
demographic, economic, educational, occupational, housing, functional, 
locational, and regional aspects, most variables being percentages, 
means, or values per capita. One analysis covered all cities, but the 
alies were also divided into categories according to size or metropolitan 
status and separate analyses made for each category According to size 
the cities were classified as “town-ciiies” below 50,000, “small cities” 
between 50.000 and 75,000, “intermediate cities” of 75,000 to 150,000, 
and “large cities” over 150 000 Of 679 listed cities the categories con- 
tained 344, 150, 106, and 79, respectively The three remaining analyses 
concerned 246 “central cities” (that is, “principal political or adminis- 
tratively defined cities” in Urbanized Areas or Standard Metropolitan 
Statistical Areas of the 1960 census). 227 “suburbs” in such areas, and 
I7I “independent cities” 

In each analysis 15 principal factors, accounting for at least 90 per 
cent of the total variance, were isolated and rotated according to the 
varimax entenon Six of the dimensions were given parallel interpreta 
lions m all anal>scs, although there were some, mostly minor, vanations 
in loadings between analyses The dimensions were named ‘Socio- 
economic Status Level,” “Age Composition.’ * Educational Center,” 

Residential Mobility,” * Population Density,” and * Wholesale Con- 
cenlration Four more dimensions, * Total Population ” ‘ Manufactur- 
ing Concentration “Durable Manufacturing Concentration,” and 
an unnamed dimension pivoted by * Percent employed in public 
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adnijmstration,'’ uercidenttfied m all but one analysis, whereas ’‘Non* 
white, Commun»cat«w Cmtcr,’* and a dimension pivoted by 
Percent m high school” were missing in i«o analyses 
Finally the profiles of cities were represented as deciles on (wo com* 
posite indices and ten single vanaWes, selected as measures or part 
measures of dimensions or. in one case, as a theorcucally imponant 
variable, which was split into several factors 
Moser and Scott (1961) analyzed 57 variables for 157 English and 
Welsh administratively defined cities having at least 50,000 inhabitants 
in 195! Thevariablescoveredihc fiddsorpopulationsiacandstructure, 
population change, households and housing economic character, social 
class, voting, health, and education Four principal components ac- 
counted for 604 per cent of (he total variance They were mferprclcd 
as “Social Class,” "Population Change between 1931 and 1951." 
“Population Change between 1951 and !95S, ' and ‘Overcrowding” 
The cities were grouped after Ibeir factor scores into homogeneous 
categories This multiple classification resulted in 14 groups which 
contained all but two (owns London being one of them Three groups, 
totaling 36 cities were mainly resorts and administratire and conimer* 
cial centers, five groups, composed of 65 cities were mainly indusirni 
towns, the remaining sw groups held 54 suburbs and suburban-type 
towns (For a further development of the technique of multiple classi- 
fication by Howard, sec the paper by Norman in this volume J Berry 
(1966) mentioned a classificational study of Indian cities by Ahmad 
Factor analysis of countries and other areal units and subsequent 


multiple classification has been used to delineuc regions of various 
kinds (cf Berry, 1966. for a review) From a purely technical standpoint, 
however, factor analysis is not necessary to precede a multiple classifi- 
cation but may be used to facilitate the grouping or for theoretical 
reasons An interesting variant of analysis employed to delineate regions 
IS to use the n(/i - J) pairs of areal units as cases and let the variables 
measure contacts, flows, or interactions between units (Bcrty, 1966) 
Basic dimensions fiav c 6«n eftfsmVJitWf 6j’ of fjrtcv aiwlysis 
also for units other than cities Gouldncr and Peterson (1964) analyzed 
59 variables from the Yale Cross Cultural files for 71 pnmitive or 
preindustnal societies They presented fO orthogonal and II oblique 
factors, four of which m both sets were, according to eapceiations. 
called “Linearity," "Sex Dominance," "Level of Technology, and 
“Apolloniamsm" or "Norm-Sending " They also made a second-order 
analysis of the oblique factors and discussed causal pnondcs of Lev el 
of Technology" and “Apollonianism" especially. 
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Counties were analyzed by Jonasscn (1961), who used 82 variables 
describing Ohio’s 88 counties He brought out seven orthogonal factors, 
accounting for 73 per cent of the total variance He named the factors 
“Urbanism,” “Welfare," “Influx” “Poverty,” “Magni-Complexity,” 
“Educational Effect," and “Proletarianism ” 

Communes were used as units by Rnhincn (1965), who analyzed 26 
normalized variables for the 548 Finnish communes He changed the 
set of variables somewhat to test the invariance of solutions Principal 
factors were used and rotated to two separate orthogonal solutions In 
one of these, the first factor was rotated so as to be best interpreted 
according to the theoretical scheme of the study as “Industrialization” 
or “Division of Labor ” The next three factors were rotated according 
to the vanmax principle The three factors were labeled “Propensity of 
Economic Disturbance,” “Changes in the Per Capita Income,” and 
“Soaal Disengagement” In the other solution the vanmax principle 
was applied directly on the first five principal factors, which were then 
interpreted as “Efficiency of the Population,” “Pressure toward Con* 
formity," “Narrowing of Differences in the Per Capita Income,” “Ex- 
pansiveness," and “Efficiency of Organization” or “Centrality ” Factor 
scores were computed for both sets of factors and mapped In another 
study Piepponen (1961-1962) analyzed Finnish communes 

In Sweden a pilot study of a small sample of 51 communes was 
analyzed for three sets of variables Two sets of 45 and 44 had their 
variables paired to make them as similar as possible and had five 
variables in common, whereas the third set contained 73 variables, 38 
from the first senes, 30 from the second one, besides the five variables 
held in common The variables were analyzed both before and after 
normalization In the larger set 9 factors were extracted, in the others 
10 factors Six dimensions were identified m all six analyses, one more 
in five of them, and still one more m all nonnormalized sets (Janson, 
1965) 

Another study aimed at a descriptive scheme for blocks (and build 
mgs) as depicted on maps of detailed urban plans (Fog, Janson, and 
LanesjO, 1965) 


1 4 Summary Description 

Factor analysis has been applied in human ecology both to intracity 
variation and to regional and intercity variation On both levels specific 
aspects have been approached the emphasis within cities being on 
delinquency, crime, and other social problems and on political behavior 
The latter is often treated on the regional level More general analyses 
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of urban spatial structure often aim at a descnp(t\e scheme, as do ihe 
general analyses of cities and regional variation There also a classifica 
tory system of some kind is often Ihc purpose of the analysis 
Technically, the analyses use as vanables mostly averages, including 
percentages and quotas Absolute numbers occur as variables in some 
cases Often amlyses now include large sets of variables which were 
not possible to handle before modern computers were available 
Prmcipat factors now arc frequently computed even for Urge malnccs. 
However, centroid factors still occur Rotation to new, mostly orthog- 
onal, axes IS common often according it> Ihc varimax principle There 
arc several cases of oblique solutions Sometimes factor scores are 
calcuhtcd occasionally by rather primitive methods for further study 
Asa rule the intcrpreiaiionsarc on a descriptive level and the analysis 
aims at generalizations essentially asa kind of empirical concept analysis 
to give basic descriptive dimensions that is the ’factors” are not 
interpreted as causal factors behind the manifest variables but as 
dimensions in terms of which a structure can be described By selecting 
as variables what might be considered partial measures Ihe dimension- 
ality of one or a few concepts is studied 


; s^s/c ^ssi/^//’r/0A^s of tub factor analytical 

APPROACH 
2 I Paste Set of Lquaiions 

Factor analysis is ba<:ed on a set of well known equations that imply 
cerwin assumpljons cooccrnins ihc variables in be analyzed Tech- 
nically there arc severatway! of arrivine at solutions with regard to Ihe 
parameters of this model These techniques may impose additional 
assumpl.ons, because they may apply to special cases only of the 

general model . . . , 

Naturally the application of Ihe basic model poses certain technical 
dilTiculties.as the assumptions most fit the actual ilala reasonably ivcll 
both theoretically and operationally 
It should be noted that factor analysis is riiscussed here, except in a 
part of Section 2-1 as a descriptive tool not as a causal model 
Here as in the previous section descriptive includes ‘inferential ‘ 
as lone as Ihe gcnerahMIitrns aimed at in the analysis are noncausal 

(cf Hartley, 1954) . . .v „ 

The basic equations of factor analysis and their corollaries may be 
compactly and elegantly staled in terms of matrix algebra, which also 
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has the additional advantage of not being easily understood by every- 
body, and thus of giving them almost as high a scientific status as 
differential equations However, the points to be made seem clearer, at 
least to me, if the basic equations are simply stated in termsof elementary 
algebra 

Thus the basic set of equations may be formulated as 

p 

Xikt - S °uffk + (1) 

/ -1 


where 

(fl) means the value for unit k at observation t on the manifest 
variable Xt, with i = 1, 2, . , w, and A: = 1, 2, , N 

(b) means the valueforunitA onacomwiofi factory}. Withy = 1.2, 

, p, and p being not greater than m and the concept of “common 

factor" applying to all factors occurring m more than one 
variable x, 

(c) j,* means the value for unit A: on a factor, or latent variable, Si 
that IS specific to variable Xj 

(d) means the value for unit k at observation i on a random 
component Cj of the variable x„ varying between observations 

(e) a,/ is the weight of factor f, in variable x, , bi and c, are the weights 
of 5, and respectively, in variable X| 

Usually the numberp of common factors is, at least implicitly, assumed 
to be only a small fraction of the number m of observed variables, 
although it might be recognized, for example, in the principal factor 
modet that formally tbere are as many Szeiors as Jhwe are varjiahJes 
All random components arc always assumed to be mutually independent 
and independent of all other factors, and all specific variables are always 
assumed to be mutually uncorrclated and uncorrelated to all other 
factors The common factors arc assumed to be uncorrclated in orthog- 
onal models but may be correlated in oblique models Some models 
contain no specific ractors,that is, the fci-coefficients are all nought, and 
the principal factor model has neither specific nor random factors (For 
a general description and discussion of factor analytical models, see 
Harman, 1967, and Lawlcy and Maxwell, 1963, for a discussion of 
applications of factor analysis, see Henrysson, 1957 ) 

For simplicity, all variables and factors are here assumed to be 
standardized, that is, to have mean 0 and standard deviation 1 If so. 
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Z + 6/ + cr = 




Ihai^rihr *^'"E Ifie of the vanable 

by the variation of the common factors 
The correlation bctivcco the vartaWe v, and the factor ff is 

fly + Z ‘»J»O,/,07^ 0 (3) 

anti the correlation between the variables Xt and x„ is 


Z^ffiA<+ Z Z ‘>iAvr/,i^07*'.'=^») f4) 

The basic set of equattoos (I) obviousty treats each variable as a 
function of a set of factors More specifically each variable is a weighted 
sum of factors that are seen as underlying or latent 


2 2 Relation beiv^een Factors 

The common factors, as just mentioned, are either allowed to be 
correlated, in the oblique models, or assumed to be uncorrelafed, m the 
orthogonal models The assumption of noncorrelaiion may be sharp 
cned into an assumption of independence in some orthogonal models 
Theuscof orthogonal dimensionsjsvcrywidespread The orthogonal 
factors scent rather often to be taken, at least implicitly, to be not only 
uncorrelated but also independent It has sometimes been argued that 
the assumption of independence (or, presumably, also of noncor relation) 
of factors IS principally unreasonable (cf Coleman, 1964 pp 20 f) 
Admittedly it certainly appears that measures of many theoretically 
important concepts should be correlated Thus we frequently seem to 
have a choice between using oblique dimensions and splitting the 
corresponding concepts so as to obtain uncorrelated (or perhaps 
independent) dimensions, transferring common components of the 
concepts to one dimension and keeping the remaining parts as uncor' 
related dimensions The important questions then are first, whether or 
not this gives us meaningful and inlerprctable conceptual units and. 
second, whether this provides a useful and illuminating analysis of the 
concepts It cannot be decided on an a pnon basis only that the second 
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altmuon A *p!ora6!t (,a6„ „f j„s, „5,„j 

J'lilabic lo see «hit mil happen puls the fimous GIGO (carbase 
m girbigc Dill) pnnoplc into force ind lias probably contributed 
to -i certain disrepute of factor analysis 
A careful selection of variables will improve the chances of useful 
results considerably In pretests or oilier cvpioratnry stud cs the selec 
tion will necessarily be rather lentatae but otherwise no variable should 
be included except for special ibeorciical or perhaps technical reasons 
If definilc Itvpofhcscs arc n n icstcd implicit fivpoihescs of relevance or 
importance of certain fields or aspects to be covered may be followed 
If possible the number of variables from the various fields or aspects 
should reflect however vjci civ notionsofrcbtiveimporianceoffields 
and aspects for the problem stud cd 

In selecting variables the problem of reilundanc) often appears that 
IS some variables become related from technical or otherwise trivia! and 
irrelevant causes This may hippen because of careless selection and 
construction of variables a I mited supply of data or a very detailed 
covering of a field 

Most direct redundancy results from one variable oHen constructed 
as a percentage or mean being literally m a technical sense contained in 
another variable If n is the number of cases belonging to the /th 
subclass out of a total of <7 subclasses together having // members one 
variable may he «,/A' and another one (n, + nj)//V Or the first variable 
may be the mean of the first subclass the second variable the mean of 
(he first and the second subclass IfJo there is evidently a correlation 
between the variables if not confounded by some peculiar compensa 
lory mechanism between the two subclasses The remarkable result will 
be the two variables not having a common factor rather than the 
opposite CISC Examples would be Manufacturing establishments per 
capita and Manufacturing establishments with 20 or more empiojees 


per capita 

Sometimes (he relation is partially hidden vvilh the second variable 
ilelinetl as I - (n, + or nJN T”! + 

the tlifTcrcnce is almost invanani tfie two variables vi/l have a slrong 
negative correlnl.on Thus there ire good reasons lobe rather restrictive 
in mclucling viriables o/tV from the same classilicationil system 
simultaneously If two or only a few vanaWes from such a senes are 
used not more than one of them should cover a big and varying 
proportion of cases and ihc proportion of cases left out should te big 
and varying If several variables from the same series are employed 
they may cover all or almost all eases together but not more than one 
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Single variable should alone cover a considerable proportion of cases 
Examples of this kind of variables arc “Percentage 0-15 years of age of 
population," “Percentage 15-2! years of age of population,” and 
“Percentage 65 years or more of population “ 

The redundancy is less direct and, as a rule, smaller if one variable 
has as numerator a subclass of another variable’s numerator but also a 
different denominator, for example, the variables “Percentage of 
economically active females of all females 15 years or over” and “Per- 
centage of economically active married females of all married females ” 
Another indirect, and somewhat less technical, redundancy occurs 
between variables with the same or almost the same denominator and 
numerators from different but actually closely related classifications, as 
between“Percentage of males with college education of all males 25 years 
or over” and “Percentage of professionals of all economically active 
males ” Here the two variables seem to have a common element more 
restricted to the two of them than, for example, a factor of social rank 
(see next section) 

Factors with considerable redundancy in their highly loaded vanables 
will appear, to a corresponding degree, as "artifactors," to use the 
psychologist Charles Lcef’s expression Analyses with many of these 
factors cannot, of course, claim much interest 
On the other hand, factors that may appear to have an artifactonal 
component may turn out to have a substantive core and remain when 
the redundancy is eliminated or greatly diminished Some factors of the 
large matrix of Schmid and Tagashira may serve as examples Another 
can be taken from the study of urban detailed plans by Fog, Janson, and 
Lanesjd In the first analysis two variables describing the character of 
the border of the block were found to be much more strongly correlated 
than expected They were constructed as percentages from the same 
classification and with the same base, and the remaining categories 
were smaller than expected The variables defined a dimension of 
“Border Characteristics,” which appeared to be a genuine artifactor, 
although there were theoretical reasons for a “real” dimension of this 
kind In a replication using new data the definitions were changed to 
reduce redundancy by breaking down the categories into more detailed 
ones, but a dimension of “Border Characteristics” still resulted, 
although slightly changed 

Also, redundancy of (he last-mentioned indirect kind differs only in 
degree from substantive and interesting connections between vanables 
Thus the distinction between redundant and genuine relations is some- 
times diffuse 
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2 4 Common Factors 

Because each variable is treitcd as a weighted sum of certain factors. 
Its variance is formall^f accounted for the variation of the factors, as 
seen in Equations 2 The correlations betw cen variables are correspond- 
ingly generated through various degrees of similarities in composition 
of the variables when the factors vary, as seen in Equations 4 
As much as possible of the information contained in the n x N 
matrix of data is transformed (o a smaller p x A' matrix of factor 
scores The very essence of the anal)sis seems to be the substitution of a 
smaller set of factors for a large set of variables The variables thus are 
treated as individual items reflecting vinous aspects of a system The 
focus of attention is transferred from the variables to the aspects, or 
dimensions, of which the variables serve as partial measures 
Some ecologists, honever. maintain a k«n mlcrcsC in the special 
varnblcs and so arc not particularly inieresied in ihe reduction of data 
to some basic dimensions As ihcir oriemaiion seldom makes them deal 
With very large sets of variables simultaneously there is rarely reason 
for them to indulge m factor analysis. 

Nevertheless, (he common factor model has been challenged as being 
not only uninteresting but also misleading The correlations between 
ecological variables are generally not to be accounted for by common 
factors, even when the variables are clearly clustered, but by causal 
relations between the variables themselves in pairs, circles, or chains 
Even here the interest in particular variables is retained, and the appro- 
priate technique of analysis would be some variant of path (Wright, 
1934) or dependence (Simon, 1954, Blalock, )96]) analysis (Otis 
Dudley Duncan and Beverly Duncan are prominent exponents for this 
line of orientation Duncanand Duncan, 1955 and 1960, may be seen as 
typical approaches by them, though not demonstrating path or depend- 
ence analysis as in Duncan, 1966 ) 

As a linear causal model, factor analysis is usually conceived as 
staling that a system of causally related or unrelated factors influence a 
set of variables that arc not directly related among themselves Such a 
model may seem rather special and not to have as wide applicability m 
human ecology as perhaps m mental testing or altitudinal research 
(See, c g , Blalock, 1961, pp 167-169) 

This conception of the factor analytical model appears to be some- 
what unnecessarily restricted If causal relations are assumed not 
between the variables but between the variables with the random 
elements removed, such relations seem to be permitted in certmn 
patterns within the model, the relaUons going from layers of variables 
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closer to the factors to more remote la>crs of variables However, these 
relations cannot be estimated by the usual factor analytical methods but 
are included in the loadings of the factors Thus the weights reflect both 
direct and indirect influences of the factors on the variables in a more 
complicated model with increased flexibility but also with correspond- 
ingly increased difficuUics of interpretations Even so, it has to be 
admitted that the factor analytical causal model, thus expanded, wnU 
have limited applicability only, as most special models have 

Factor analysis may be useful, however, as would cluster analysis, 
both technically and theoretically, in order to provide a preliminary 
grouping of variables in more or less closely connected systems, the 
causal structure of which could then be further explored by means of the 
approachesjusl mentioned Also, the factors may be entered as variables 
in a causal system (Cox, Ch 13 of this book) In fact, this seems very 
much in line with the mam stream of factor analytical research, which 
aims at developing sets of basic descriptive dimensions These may be 
valid regardless of the various possible causal processes underlying them 
if they are not vested with causal meanings Furthermore, examples of 
causal interpretations of ecological correlations given in terms of 
specific variables instead of factors are not always convincing 
First, the manifest variables are generally seen as fairly reliable and 
valid measures only of the theoretical variables between which the 
proposed causal relations will hold, so actually the relations are between 
components of the variables Second, the relations sometimes refer to 
the individual rather than the ecological level Thus they may be at 
work for the individual variables, whereas the ecological variables are 
complicated through additional processes making them react more as 
measures of some other factors, in other words, the distinction between 
individual and ecological correlations is not upheld Third at times the 
statements seem to be unduly simplifying a rather complicated situation 
Three examples may illustrate these points, the first and third of 
which arc involved in all of them, though the second point is relevant 
only to the second illustration The examples are not intended as 
elaborations of a position that the factor analytical model always or 
usually IS superior to all other models On the contrary, the position 
taken is still, as was earlier stated that the factorial model of basic 
dimensions on a descriptive level profitably may be pursued by other, 
causal models of the particular problem analyzed 


1 In discussing the study by Gouldner and Peterson Stinchcombe 
(1964, p 550) concentrated on the variable Agriculture It must have 
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a direct causal connection \vilh the variable “Use of Gram for Food “ 
It IS also the cause of three factors namely * Level of Tcchnolog)* and 
factors described by “Permanency of Residence” and ' Private Property 
in Land," rcspcciivcl> 

If one considers (he measuring of variables m four-point scales from 
dominance or marked elaboration to absence the rather low correlation 
of .59 between the two variables, and the possibilities of ujmg gram as 
object of trade and perhaps also food for domestic animals, it is not 
entirely unreasonable that approximately half of the correlation between 
“Agriculture" and ‘Use of Gram for Food' should be accounted for 
by the technological factor With regard to the relation between 
“Agnculturc" and the three factors it is an open question whether 
Slmchcombe's point of view is more plausible than Gouldncr's and 
Peterson s or vice versa 

2 Instead of introducing a socioeconomic dimension to account 
partially for correhtions between educational occupational, income, 
and bousing variables education may be considered to lead to certain 
occupations and through them to certain levels of income, which makes 
It possible for some people but not for others to have a house or rent an 
apartment of a given value and size 
Evidently this point of view has merits on the individual level 
However, it is very doubtful that the ecological correlations between (he 
corresponding ecological variables can generally be derived from such 
simple processes on the individual level In urban areas with many 
highly educated people not only these people will provide the area with 
a gre. 1 t number of high-income families, but also other wealthy families 
will be attracted by the same qualities of the area, etc 
3 In (he study of detailed urban plans one dimension was interpreted 
as “Block Size ” Then the variable “Block Area” might be suggested to 
generate the other variables of the cluster such as Number of Buildings” 
and “Length of Border “ However, ‘ Block Area is no more an 
independent variable than are the others as the size and shape of the 
block are often manipulated by the planner rn designing the area The 
variables arc thus simply interrelated aspects, among others, of the 
solution of a planning problem (Fog Janson, and LanesjO, 1965) 


2 5 Defimtton of Factor 

The concept of “factor* m the factor analytical sense is operationally 
determinetl Ihmagh Die roalhematical procedures m compolmg the 
rve, gills and scores orUie factors It has bcci. argued that factor analysis 
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IS unfit for delecting basic dimensions of a structure because its factor 
defintuon is so broad that ‘ factors" will alwajs result, so no test is 
provided of the fundamental problem, whether or not there arc any 
common features of the variables studied (Coleman, 1964, p 20) 

This statement needs some quabficalions \t is bteiaWy true tVial some 
factor or factors will always result from a factor analysis Nevertheless, 
the outcome may lead to, or make the researcher inclined to, a con- 
clusion of no useful onmportant basic descriptive dimensions Ifasct of 
randomly generated variables is analy 2 cd. some factors will certainly 
emerge But if the observed correlations are based on a reasonably 
large number of cases, the factors will be weak (unimportant) in terms 
of the proportion of the total variance accounted for by their variations, 
that IS, the communaluies will be low, indicating that most of the 
variation of the set is uncorrelatcd with the dimensions, unless we take 
out a great many dimensions, thus losing sight of the main purpose of 
the analysis Also, the factors, or their weights, will be completely 
unstable from one sample of data to another Furthermore, even if we 
gel fairly important dimensions in a set, importance still being measured 
in terms of variances accounted for. wc may not be able to make any- 
thing out of them, asweeannot interpret them They may be inlelligable 
after rotation, but as long as we have not found any such rotation, the 
anal>sis has not given us a set of basic dimensions (see Section 2 6) 
Of course it is not a necessary consequence of the calculations involved 
tn the analysis that the dimensions can always be interpreted 
On the other hand, it must be admitted that the concept of factor in 
the factor anal>tical sense is rather weak, not compared to the concept 
of a unidimensional scale according to Likert or Thurstone, for example, 
but compared to such a scale according to Gutlman As mentioned, the 
principal factors of a set of items with a perfect cumulative scale are, by 
definition. Gunman's principal components If items from two correlat- 
ed scales arc analyzed together, the two sets of principal components 
will not emerge as distinctive principal factors If items measuring the 
‘ intensity of the scale arc analyzed together with the items of content," 
the resulting principal factors wiU blur the content and the intensity 
somewhat If the principal factors of a perfect cumulative scale arc 
rotated, for example, according to the varimax principle and factor 
scores arc computed for the new dimensions none of these will rank the 
cases perfectly in the order of the original scale (Riley and Janson. 
forthcoming study) 

As It IS a reasonable demand of a technique for discovering basic 
dimensions that it be able to detect possible perfect cumulative scales 
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involved the staled pcculiante of factor analvas nilh retard to 
perfect scales may be seen as more or less serious disadvantages 


2 6 Nmmtqueness of Sohidons 

As has been pointed out many times the factonrafion of the variables 
m bquations 1 js not umque Any linear set of functions 


X ^/ai5«rlt (5) 

transtorms the variables to wcightcti sums ofthe new factors with the 
same qualities as the Equations I if theg„ factors are standardized and 


«?i = I (» ?) (6) 

The comntunalities wUi be (he same as before Some solutions nmII be 
preferable compared to others according to various criteria but 
generally no single solution will stand out as (he only possible or best 
one 

This nonuniqueness of faclonzaiion bis often been held out as a 
weakness of factor analysis Clearly it reduces ihc value ofa negative 
result If an expected set of dimensions docs not become esident it 
might emerge after some rotation (5) 

On the other hand the nonuniqueness may also be seen as a sign of 
fundamental soundness of the method or at least as ba\jng useful 
aspects Methods of mathematical and statistical inalysis may often 
conveniently be looked upon as formal and systematic forms of 
ordinary everyday procedures to draw inferences of the external world 
from empirical though perhaps informal and iins)stematic observa 
tions 

If this reasoning is applied to factor analysis the nonuniqueness of 
solutions reflects the important nonumquenessofcheoncsandframcs of 
references If one such conceptual sebetne is successfully applied to a 
matrix of data this of course does not mean that no other schemes will 
be applicable or fit the data equally well as they often are at least 
partially transferable and able to explain similar phenomena but m 
dilTcrent ways This principle then comes out clearly in factor 
analysis through the infinite number of possible solutions, serving as a 
practical lesson m scientific tolerance 
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3 BASIC TECHNICAL ASSUMPTIONS OF FACTOR ANALYSIS 
3 1 Le\e! of Measurement 

3 1 I Kind of Variables Equations 1 and the definitions used there 
imply that each variable, as a weighted linear sum, is measured at the 
level of interval scales at least This, of course, is a very important 
assumption Hov.c\er, many variables in ecological theory arc defined 
and used on an ordinal level only, as the reasoning is mostly verbal and 
often rather vague A whole set of possible operational definitions of a 
theoretical variable may give the same rank order of areal units, because 
any measure in the set is a monotone function of any other measure 
To avoid the assumption of interval scales, a nonmetric (ordinal) 
analysis may be used In the selection of an alternative to ecological 
factor analysis it is advisable that the quantitative character of the 
variables be preserved Ifihe variables arccompressed mtoafew ranked 
categories, information is lost, and severe new problems are introduced 
in choosing more or less arbitrary cutting points 
When the original variables themselves are dichotomies or a few 
ranked categories, such as administrative type of communes or roll-call 
vote of countries in the UN assembly, it will be easier to find alterna- 
tives to factor analysis among the scaling models of social psychology, 
for example, latent structure analysis or Guttman's cumulative scale 
analysis (cf Torgersen, 1958) However, the question of level of 
measurement docs not apply to dichotomies In ecological factor 
analysis the question of level of measurement arises mainly in connec- 
tion with two types of variables The first type is the ordinary percentage 
or some other average For instance, the “Sex Ratio” is often used m 
theoretical contexts in such a way that a valid measure on rank-order 
level must arrange the cases according to the proportion k = Wj M (not 
to be confused with the summation index k used m the set of basic 
equations) of women to men in the adult population There may be 
some doubts about what age groups the measure should refer to If this 
point has been settled, then all monotone functions g(jV) produce the 

same (or exact oppo^tc) rank order as for example, 1/*, A:/(k + 1). 
log k. e>‘, Vk. I/vT k^k + !)^ or vT/vT + T But they all 
give different frequency distributions Usually ecological theory does not 
demand one particular measure g{k) In that case, there is no reason 
why any g{k) should be considered a measure on the interval scale level 
if there arc no other criteria to apply Such possible additional criteria 
may concern, for example, parsimony the frequency distribution, or the 
relations to other variables Some function g{k) may perhaps have 
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linear regressions to most other rdesani sanablcs and so be preferred 
lo uxQsc iwtn cursihncar regressions 
For Ihc scyalro. either the lanable H'/(4f + II’) as a simple measure, 
not so Skewed as A and possessing definite upper .and lower bounds, or a 
monotone transformation of X or any monotone function g(l) to a 
normal!) distributed \ariate will show many preferable qualities 
Usual!), howescr the choice octween the many possibilities of 
»ntcr\al scales that are equivalent on the ordinal IcicI seems to be 
largely ignorcil Percentages are often used more or Jess routine!) 
The correlations between the possible variates on the interval level are 
general!) far from complete because of the nonlinear relations between 
Ihem The choice is thus possiWj far from unimportant 
Another t>pc of variables is much more in^requeni and usually 
introduced in a set more reluctanily namely an ordered set of categories 
with arbitrar) values, usually consecutive integers, attached to the 
categories Tojustify such a varnWe as an interval scale the numbers 
should be assigned so as to give equal intervals*’ in some sense, as 
through some judgment technique like the method of paired compart* 
sons with experts as judges Insuchacase the estimates of scores wil! 
besubject tosomeerron and the simpleseries of integers maysometimes 
he considered an approximation to them 
If the set IS not too long and the order of the categories is undisputed, 
the dilTercncc between the senes of true scores and the senes of arbitrary 
integers may be unimportsnt with a correlation sulTiciently high to 
justify the use of the latter as an approximation of the former The 
parallel between the arbitrary scoring of alternatives of items on a 
Likcft scale and scoring by means of normalization is obvious. 

In the study of detailed urban plans by Fog, Janson, and Lanesjb, 
buildings were classified m six main types according to form of floor 
area and the types ranked after degree of "closedness” or ‘sheltcred- 
ness " The decision on order was unanimous among the experts con 
suited The types were assigned numbers from 0 to 5 Various sets of 
alternative scores considered reasonable instead of the scores J through 
4 together with various assumptions of distribution of cases gave 
correhlions to the integer series of 90 or more in most instances The 
integer senes was then considered a rather satisfactory approximation 
to a wider set of variables among which the "true scores variable' 
probably could be included 

The average score on 'shcltcredncss” for the block was then used as a 
variable in the analyses Another vanable of the same type, expressing 
“openness” of grouping of buildings, was also used with less 
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jusuficaUon, as lU correlation with aUcmali\c scoring systems was not 
studied although the senes of types was longer 
The tentative conclusion, however, is that variables of the ordered 
categories type can sometimes be treated on the interval level at least 
as well as the usual percentage or mean variable This is probably 
contrary to the opinion of most experts and thus somewhat question- 
able 

Nevertheless, whatever reason is given for selecting a particular 
measure from a set of possible measures, the selected variable must be 
treated as an interval scale 

3 1 2 In\ariance under Scleciion of Variables The procedure of 
selecting and defining variables for a study can be seen as a process of 
gradually approaching a degree of specification necessary for the actual 
calculation of values on the variables It may take place m stages or 
steps, with the making of inventories and lists of increasing exactitude 
It starts with a list of the general fields or aspects to be covered Un- 
fortunately, this list often remains implicit Within the fields or broad 
aspects more specific aspects arc delineated, and this leads to a general, 
theoretical definition of a variable Some problems of the process so far 
were outlined in Section 2 3 Here we arc discussing the final step of the 
procedure, the selection of an operational definition on the interval 
level from the class of such definitions possible under the chosen 
theoretical definition, which usually specifics an ordinal measure at 
most 

If different interval scale definitions lead to practically the same 
systems of correlations, because they arc themselves very highly cor- 
related or for other reasons, then the choice between them literally makes 
no difference If various operational definitions do give different 
matrices of correlations, the choice between them evidently may be 
important However, it does not necessarily influence the results 
decisively 

Suppose a change of variables means a rather regular and propor- 
tional shift of the general level of correlations in corresponding arrays of 
the correlation matrix rather than irregular and spotwise alternations 
Then perhaps the differences in results will be limited to changes in 
communalities with the compensatory changes in loadings of the 
common factors rather proportional If these changes concern the 
whole matrix or if they arc restricted to certain arrays but limited in 
size, the number of factors may be constant their relative importance 
approximately unchanged and the interpretations roughly invariant 
When variables are selected according to theoretically relevant 
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criteria, il js inicrestine to see what consequences various choices will 
have When the choice is a tecbnicai question, it ought to have pre- 
dominantly favorable technical effects, such as enabling the researcher 
to reach a conclusion otherwise not obtainable On the other hand, 
when one disregards changes m technical efficiency, the effects on the 
substantive results are artifacts They should be discernible or slight 
After due allowance for them the remaining substantive results must 
show considerable invariance under various technical solutions m the 
sense that such solutions must not lead to incompatible conclusions 
about the subject studied 

When new variables are added to a set of variables, they create 
possibilities for additional dimensions Jf they introduce new fields, the 
dimensions may reasonably be substantively unrelated to earlier 
dimensions If they cover essentially the same aspects as the original 
variables, they make possible further distinctions among the old 
dimensions, dividing them into subdimensions and perhaps slightly 
regrouping a few of them 

If the stress of the variable setup is different because the problem 
studied differs, then (he results of the analysis will naturally be 
different On the other hand, if the sets are mainly parallel with 
differences consisting mostly of exchanges or additions, of equivalent 
variables, the substantive results should be mainly parallel though 
more detailed m case of an enlarged matrix at least when possible 
expected artifacts arc eliminated Otherwise, factor analysis will not 
be a useful research tool 

This invariance under sefcciion and definition of variables is expected 
on various levels down to the purely technical one of deciding on 
interval scale measures of given ordinal variables 
The invariance of findings over parallel sets of variables of various 
sizes has been studied m some investigations (Schmid and Tagashira, 
Sweetser, 1965A, Janson, 1965) The results point to a considerable 
invariance The invariance over parallel set of variables with size of set 

constantwasaIsostodied(Swcetser,I965a.Rnhinen,I965,Janson.I960. 

1965) and Kaplan compared sets mtb and » about i&r/ables expressed 
in absolute numbers included Here the analyses also showed remarkable 
mvanance Alternative sets of ecological variables have not yet been 
analyzed extensively More studies ofthis kind should be made although 
experiences of factor analysis in other disciplines where it is better 
established especially psychology, should be consulted, naturally with 
due consideration taken of the conditions that may be special to human 
ecology 
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Finally the possible invariance on the most technical level, that is, 
when choosing interval scale measures of theoretical ordinal variables, 
IS even less often studied Nevertheless, it is a problem well worth 
studying Very likely it contains aspects special to human ecology, 
thus the possibility of drawing from other fields of factor analysis 
decreases One aspect of the problem will be dealt with shortly in 
Section 413 


3 2 Variables as Linear Sums of Factors 
The fact that thevariables are seen as weighted linear sums of factors 
of course, excludes nonlinear and nonadditive functions of the factors 
as variables, insofar as they are not transformed to at least an approxi- 
mately linear sum 

A linear sum may, m some instances and in certain intervals, approxi- 
mate a more complicated function fairly well Functions of higher 
degrees than the first, exponential or logarithmic functions, and products 
may be transformed into linear functions, for example, the logarithm 
of a variable can be substituted for the variable itself in the case of a 
product of various degrees of factors A higher degree of a factor may 
come forth as a separate factor if factors are not independent 
Nevertheless, it must be admitted that factor analysis can handle 
directly only a limited class of functions Thus interactions between 
factors arc difficult to take into consideration 
Furthermore, m the basic set of Equations 1 it is assumed that the 
weights are the same for all values of a factor Thus the case where 
positive values on a certain factor arc important to a given variable but 
negative values are not is not included Nor are the cases where only 
numerically large values of the factor influence the value or where the 
weight of a factor is a furvclioa of another factor 

Under such conditions the estimate of a weight may be seen as an 
estimate of a kind of mean weight Compared to ‘ true ’ weights in a 
given interval of the factor, the estimates may thus be deflated or inflated 
Furthermore, other degrees than the first of the factor may be intro 
duccdifdcpendcnt factors arc permitted Also weights of other factors 
ma> be influenced This further underlines the need for careful selection 
of variables Again, the problematic assumption might be avoided by 
choosing another method of ana1>sis (sec the discussion of nonadditivity 
in Alkcr, 1966) 

NViihm the realm of factor analysis, there are a few steps that may 
improve the situation somewhat First, when a variable has a mulli- 
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plrcatlve factorial element, it may sometimes be eliminated and the 
residual, approximately addmve element of the variable included in the 
IS o variables instead A residual factor anal) sis md.y be !i]so 
or other reasons (Tukey, 1962) Second.jostasananaJysisorvariance 
can be broken down in subanalyscs when the interactions arc too many 
or too complicated for easy interpretation a factor ana!>sis may 
profitably be divided into several analyses The variable or variables 
for which interactions or varying weights arc detected or suspected are 
used to divide the cases wto subc}asses, andananaiysists made foe each 
class Interesting differences may be revealed and the assumption of 
linear sums may be upheld The resulting matrices and patterns can 
be compared by inspection or more formally analyzed for example, 
by means of Ahmavaara's and Markkanens transformation analysis 
(Ahmavaara and Markkanen, 1958 pp 80-S8) 

Hadden and Borgatta divided the cities according to size and 
“metropolitan status * Within the general tendency of similarities be- 
tween the various classes of cities they also found interestingd/ffcrenccs 
For instance, for “Large cities and Independent cities'* the “Non- 
white” factor was fused into the factor of 'Socioeconomic Status Leicl," 
which was stronger for these cities than for the others For all cities 
there was no “Size" factor but for each sobcategory such a factor 
appeared 

Fog, Janson, and Lanesjd made analyses of detailed urban plans 
separately for plans concerning the rebuilding of central blocks and for 
plans of Suburban or new blocks The latter had a “Tallness ’ factor, 
which did not appear for the former Allardt made separate analyses of 
political behavior, particularly Communist strength, in five categories 
of Finnish communes and found differences between Backwoods and 
“Industrial’’ Communism 


3 3 Uniform Weights of Variables 

Another important assumption of the set of basic Equations I is the 
uniform composiUon of a srvon larriifc Em* vonoWc *as rtr same 
compos, lion for all an, is Tfiis means lliar ihc randnm componcm has 
the same vscighl in a variable for all areal unils As a rule, the variable is 

afunction.usuallyamean.suchasapcrcentagc of' indivtdua values 

It contains a random clement even if all ’ individual units (persons, 
families, dsvellmfs. buildings, enlopnscs. Me) nilbin Ihc areas are 

First, there is a random element m the mechanisms b) which 
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individual units, within given categories of units, are located in a given 
area Second, there is often a possibility of observational errors in 
the observations 

The size of the random element in the ecological variable depends 
on the number of individual observations on v. hich the values are based 
But this number usually vanes between areas, and il is often impossible 
to lower this variation below a rather high level Furthermore, the 
random element in the values may vary between areas also because 


areas differ m degree of homogeneity 
As a rule, the number of observations and the degree of homogeneity 
cannot be kept sufiiciently constant between areas to make the random 
element of the value on the ecological variable the same in all areas 
Instead of a uniform weight c, for the random component of the rth 
variable, a senes of weights c,v for each case will become operative 
This will generate corresponding variation in the other weights as the 
weights arc connected according to Equations 2 The anal>sis will 
provide estimates of mean weights c, etc , in Equations 1 
A general solution to the problem of a^varying random clement of 
given variables will then be to make the relative importance of the 
random component sufficiently small for the mean weight c, to be a 
good approximation to the weights Evidently, a great relative 
variation of the weights is not so important for the composition of Xi 
if the weight c, of the random factor is small as if the weight is great 
In order to keep the weight of the random factor small enough, 
rather large and homogeneous areal units should be used It is especially 
important to avoid units with very few observations, as their ecological 
values will contain a considerable random element, even if the areas are 
homogeneous With regard to percentages lower limits should be 
maintained both of the denominator and of the expected value of the 
numerator under the hypothesis of no systematic variation between 
areas Moreover, if the mean size and the lower limit of the number of 
observations per area arc rather small, the variation in size should be 
r«tricted For other reasons this holds under less restneted conditions 
also as a s>stematically uneven sizc-distnbuiion of units may influence 
both size and sign of correlations 


Similarly, the limitation of random components naturally has other 
advantages than makmgweights uniform. Asmother research situations, 
random elements hamper generalizations m diminishing mvanances 
twiwcen samples, whether replicative or systematically vaned Of course, 
they also may wnccal possible “real ’ difTercnces, as systematic elements 
are correspondingly weakened 
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3 4 Modifiable Units 
As IS well known, the ccologteal units are 
units can be modified in 

to vary the population “"f compared core 

motivated from ecological theory, Helsinki (Swectser, 

and suburbs ,n the Boston the 

1964, and the paper by ''^;”t“roblem Hadden and 

choice of units to study is mote o jUjrMtively defined urban 

Borgatta investigated the SopoWan Areas, and 

units, employing Urbanized Atoas Stantod Met 

politically defined cities as units They rouna 

by no means total, ‘"''““"f the areal units evidently can be 

In analysis of urban ,„|i„ite number of nays as m 

delineated and combined in an Cerlainl), some sets of 

using blocks conveniently as t„h„s but only etccp- 

units are clearly and ^ ,|| others according to cntcna 

tionally IS one particular se tal or specific homogcncily 

selected as relevant, such m ° ‘ j,t„„e„seness To reduce the 

character of social subW*'™ “/^tny limited class among uhich 
class of most P^f'^tle sets to a sufflc ^ m an 

the choice may be '"“■'T"'"'/" S sue is partially lechmcal a the 
size of units The question “f „ith.n s'"' Tf 
number of available cases “^^t^crements on the numto o 

defines a lower limit on size whemas '^ ^^^^_ Ptir instance «nsus 

untts to study indtcate -Xlogtcal units but may be too 

the only practical constructed (he 
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Nevertheless, the c correlation ,tv of units and 

analysis The the degree of homogcnc y 
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332 


CARL-GUNNAR 3ANS0N 


will generally produce different matrices of correlations, but the eco- 
logical conclusions reached ought to be invariant under choice of sets 
of units if a “good ’ choice has been made Naturally, a bad choice of 
units can deprive the study of most of its interest The matrix with the 
highest level of correlations generally should be technically most efficient 
Thus, there is a question of invariance involved also in connection 
with the selection of areal units As one way of treating this problem, 
it is suggested that ecological factor analysis works with alternative 
sets of units in situations where selection of units occurs, as in studies 
of urban spatial structure The study of Swedish cities at the Swedish 
Building Research Institute uses two sets of units of different mean 
sizes for some cities Otherwise, such alternative sets are very seldom 
used 


4 OTHER TECHNICAL PROBLEMS 
4 1 Disinbutwns 

4 11 Dislrtbuiional Assumptions In the basic Equations 1, or in 
other equations necessary for the factor analysis, there is no assumption 
of a specific frequency distribution of factors or of vanables Thus 
normal factors or variables are not necessary 
Optional assumptions concerning distributions may, however, be 
useful First, they may add new permissible operations to the analysis 
For example, some tests of significance of weights and of number of 
factors have been derived where normality is assumed, although the tests 
may have some degree of robustness Second, additional meanings or 
more satisfactory interpretations may be given to operations or results 
under assumptions of mullmormal distributions For example, the 
pnncipal factors may be identified with the principal axes of ellipsoidal 
density surfaces when variables arc multmormally dfslnbuted When 
uncorrelated factors arc multinormal, they are also independent, as is 
well known, and that usually makes interpretations much easier Multi- 
normal factors also generate multinormal variables When there arc 
several independent but not normally distributed factors, the distribu- 
tion of the vanables will be somewhat closer to normality than those of 
the factors It should be noticed that the transformation of two variables 
separately to onc-dimcnsional normal distributions does not neccssanly 
produce a bivanate normal distribution The regression of one trans- 
formed vanable with regard to the other may even be curvilinear 
4 12 Re^ressional Assumptions The possibility of curvilinear re- 
gressions. even if the marginal distributions arc normal, certainly is a 
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Whatever formal reasons there may be for norma! variables and 
however fortunate ecologists would be to have their variables distributed 
that way, the ecological results should be invariant under transformation 
of variables unless there arc theoretical reasons for preferring one or 
the other type of variables Without such reasons, the choice is opera- 
tional and can be made to increase efficiency and to facilitate the re- 
search work but must not otherwise influence the ecological conclusions 
On the other hand, if the transformation makes no difference at all, 
perhaps the original variables can be interpreted in the same way as the 
transformed ones 


To study the impact of normalization in relation to the study of 
Swedish communes, three sets of variables were analyzed both before 
and after normalization of marginal distributions m a pilot study of 
51 communes (Janson, 1965) If incompatible results were obtained, 
they could be treated m either of two ways Either they might both be 
dropped, or one version might be kept, if it appeared to be based on a 
more reasonable version of measures Convincing reasons why one 
type of variables should be preferred to the other could be put forward 
in but one case 

The distributions of “Population** and two closely related variables 
were originally very skew They held a concentration of units with small 
values and even smaller differences between values When normalizing, 
these insignificant differences, often random m both size and direction, 
were given great importance Here it seemed more reasonable to 
use a measure of size which allocated approximately the same values 
for the smallest units of 2,000-3.000 population than to employ a 
measure that made differences of some ten persons in the smallest 
communes as important as differences m population between the big 
cities say from 250 000 to 25,000 If different results concerning size 
should appear, it was decided in advance to favor the results from the 
original, nontransformed series 


""“'"’y parallel with the exception of a 
bizc lactor appearing in the original sets but not in the normalized 
clusters of population variables were dissolved and 
™"ables of an “Urbanization” factor It was then 
’’ "°™al,zation in the mam study Instead, other 
skewness employed there to reduce curvilineanty and 
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As >et there seems to be no generally accepted and generally applic- 
able way of deciding when to stop factoring, that is, how many dimen- 
sions to extract In general, there are more or less obvious upper and 
lower limits, but in between there is often some room for discretion, if 
not, the strictly analytical formulas are thought to apply Besides, even 
if the number of significant dimensions has been formally estimated, 
the last few dimensions may be too weak to be ecologically interesting 
or even interpreted Furthermore, different levels of significance may be 
selected 

In situations where the number of dimensions cannot be decided 
without ambiguity, it might be interesting to know what degree of 
invariance the rotations according to a given principle have when the 
number of rotated dimensions vanes If p principal axes have been 
extracted, they will not be changed by the extraction of the (p-i- l)th 
axis But if p principal axes are rotated to vanmax dimensions, these 
Will hardly be completely stable if instead p + I axes are rotated 
Again, there is a question of invariance Acceptable modifications would 
be the split of one dimension in two and perhaps the regrouping of 
two dimensions into three, but more thorough alterations would make 
the rotation dubious 

In the study of Swedish cities vanmax rotations of 6 and 8 principal 
factors are compared In two analyses of detailed urban plans (Fog, 
Janson, and LanesjS), 6, 7, 8, and 9 pnncipal axes were extracted and 
rotated to vanmax dimensions On the whole, the changes were small, 
and the 6 original dimensions were rather intact also with 9 dimensions 
For technical reasons, the 6 of them together accounted for somewhat 
less variance, but also this changewas rather small Obviously, however, 
the pnncijjal factor solution here has an advantage in its complete 
invariance under addition of new dimensions (cf Rnhinen) 

4 2 3 Communaliiy Esiimafes The need for inserting communality 
Climates m the mam diagonal of the matrix to start factoring plagued 
factor analysts for a long lime With the advent of the modem computer 
equipment matters improved considerably in this respect Now the 
^estion o communality estimates seems quite unimportant in most 


First, cfTiacnt estimates from assumptions of a given number of 
?u f^ultiplc correlations are easily available Second, 
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of such technical invariances, which must be satisfied for the method 
to be useful m revealing substantive invariances, can be stated for all 
methods of analysis, although some methods may offer more alternatives 
of procedures than others 

For factor analysis the invariances concern primarily the following 
a The number and selection of theoretical variables 
b The operational definitions of variables 
c The definitions of area under study 
d The sets of areal units 

e Certain computational and rotational techniques and number of 
dimensions 

Not many ecological studies of such invariances have been reported 
Thus far, they point at a considerable invariance and tend to support 
the claims of descriptive and classificational usefulness of factor analysis, 
provided the method is used with an open mind for its limitations 
Thus the not very original conclusion is that more studies arc needed 
but that the evidence presented for the time being is mostly, but not 
totally, favorable to factor analysis 
Finally, it is suggested here that factor analysis, when used to define 
basic descriptive dimensions, could conveniently be carried one step 
further than to the dimensions It should more often be followed by 
computation of factor scores, that is, the positions of the units on the 
dimensions, for further analysis by other methods 
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Voting in the London Suburbs; 

A Factor Analysis and a Causal Model' 

Tke dichotomy of a poljijcally rjgbi-wing suburb and a left-wing 
central city has been a pervasiw theme in recent writings on the electoral 
geography ofthe United Slates * Much ofihe interest in suburban voting 
patterns accompanied the rapid suburbanization of population which 
took place after 194^, thus, the phenomenon was hrst seriously noticed 
after the 1952 election when 51 of the 67 counties included by the 
Census Bureau in its classification of the twenty largest metropolitan 
regions m the United States, went Republican ’ In England, this feature 
of electoral geography has attracted less attention, but the Conservative 
disposition of such suburban constituencies as Solihull, Tynemouth, 
and Knutsford when compared w-nh the support for the Labour Party 


‘ Revised version of an anrcte fi«i primed m the M/mafj of ihe Assotmsion of 
American Geographers pp JII-127 Pnmed here with the permission of 

Ihe Associalion of American Geographers The onjinal lilte was Suburbia and 

Voiing Behavior in Ihe London Meiropoh^ Area 

An earlier version of this paper was awarded mt prize in ihe Wmois Ceographica I 
Essay Awards compeliiion t j l . r , - ,k .. 

* For examples of recent wniings on the dichotomy of a pofiticafly right wing 
suburb and a left wing «n'n»ioW.s« fs JHere a /^epiMcan Afojonry » 

/NfwViirk Haroer J945J. W Whyie, TV Gis^ww^rwr A/®t (New York Simon 
and Schuster 1956)' B Lazefwitt Suburban VoJwe Trends I94S~1956 ' Soc/a/ 
forca JP I (Oclo^r 1960} p 29. and F Greenste.n and R SVoIfinger. “The 
SubSis and sSmg Party Loyah.es," Opmmn !?«arlerfy ZT (VVmfer I95S- 

*^*G* Janwfk, “ITte Subortiia Psolitieal Significance," Carrenr History 
(August 1956),pp’9t-95 
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in their respective central cities of Birmingham, Newcastle, and Man- 
chester suggests that the dichotomy is not confined to the United States 
Certam other evidence suggests that political behavioral difTerences 
between central aty and suburbs may not be confined to party pref- 
erence Thus, recent work by Campbell ei al suggests for the United 
States that there are differences in the level of turnout between central 
city and suburb, the central city showing markedly lower levels of 
turnout than the suburbs * 

This paper has four purposes 

1 To test the hypothesis that the suburban environment in the 
London metropolitan area is associated with party preference in a 
Conservative direction 

2 To test the hypothesis that the suburban environment in the 
London metropolitan area is associated with voter turnout in the 
direction of greater suburban turnout, and dependent upon verification 
of these hypotheses 

3 To specify the significance of the suburban context for these two 
aspects of voting behavior when social contexts other than the suburban- 
central city dimension are taken into account 

4 To draw out the theoretical implications of the results for the 
study of the geography of intraurban voting behavior 

Beyond the immediate conclusions regarding suburb-voting be 
havior relationships in the London area it is hoped that the study will 
indicate techniques which might be used in the study of these same 
relationships in other parts of England for comparative purposes The 
study universe comprises the Parliamentary constituencies wholly or 
partly within the London metropolitan area, the only definition of the 
London metropolitan area is that made by the Metropolitan Police, 
and this is approximately comadent with the area outlined by Freeman* 
as the London conurbation in 1955 (Figure 1 and Table 1) 


percentage vole going to the Conservative Party in the election of 
1950 in these three cenlral ciiy-suburb pairs are illusiraiive 
Central Oiy Suburb 

Birmingham 42.7 Solihull 63 3 

Neivcastle 42 3 Tynemouth 49 30 

Manchester 42.9 Knuisford 49 2 

A CimpWl P Z Converse W E- Miller and D E Stokes. The American 
li^erfNew York Wiley 1960) p 464 

T W Freeman The Conurbauons of Great Britain {Winchciier Manchester 
University Press 1959) p 66 
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Paruamlntarv CoNsrrroENCi^ of • 
London Mi-tropoutan Area 


• 1 Battersea 

2 Bermondsey 

3 Bethnal Green 

• 4 CambcfNvcll 

• 5 Chelsea and Kensington 

• 6 City of London and 

City of Westminster 
7 Deptford 

• 8 Fulham 

9 Greenwich 

• 10 Hackney and Stoke 

Newington 
•11 Hammersmith 

.1? sr.'i.s, 

• 14 Islington 

• 15 Lambeth 

• 16 Lewisham 

• 17 Paddington 
18 Poplar 

21 Southwark 

22 Stepney 

• 23 Wandswortn 
24 Woolwich 

• 27 Eal.nEi>"<‘S»“""‘" 

28 Edmonton 

• 29 Enfield 

30 Finchley 

*’4 Srandl*wcrth 


35 Hornsey 

36 Ruislip Nofthwood 

• S island wood G,ee„ 
39 Twickenham 

• 40 Wembley 

• 41 Willesdcn 

42 Spcithorne 

43 Uxbridge 
*44 Croydon 

45 Kingston 

46 Merton and Morden 

47 Mitcham 

48 

49 Sutton and Cneam 

50 Wimbledon 

51 Carshalton 

52 Epsom 

53 East Surrey 

54 Esher 

55 Barking 

56 Dagenham 

• 57 East Ham 

• 58 Ilford 
59 Leyton 

(fi Walthamstow 

• 61 West Ham 

52 Woodford 

63 Epping 

64 Beckenham 

65 Bexley 
55 Bromley 

69 Barnet 


» — r jwo or more cou 

rSco-"“'"r 



346 


KEVIN R COX 



0 2 4 

MILES 


Figure 1 

Location of CosynruESCTEs 
ftoie See Table 1 for the key to the names of constituences 

The voling data to be analyzed arc for the General Elections of 1950 
and 1951, and the dependent variables will be the proportion of the 
total vole going to the Conservative Party at a given election and the 
proportion of the electorate voting in a given election The distribution 
of the percentage of the total vote going to the Conservative Party m 
I IS presented in Figure 2, the distribution of the percentage of the 
e ectorate participating m that General Election is presented in Figure 3 
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cr » 4 34?4 


Figure 3 

Percentage of the Ecectorate Participating in the 
General Election of 1951 

basis of this, one can identify certain problems of research design re- 
quiring rectification and can tentatively define suburbanism. Second, a 
Wide array of socioeconomic variables will be chosen and reduced by 
means of principal-axes factor analysis to a much lower number of 
dimensions descriptive of the social contexts within which the popula- 
tion of the area votes Third, the hjpotheses will be tested against a 
suburban-central city dimension by means of simple correlation. 
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of the individual voter and (by tacit assumption) contextual character 
istics that show no patterned variation between central city and suburb 
Berger for example in a study of a new industrial suburb to the south 
of San Francisco found that central city to suburbs residence change 
had no effect on the proportion voting Republican His conclusions 
have been supported by a study of a residential suburb of Kalamazoo, 
Michigan “ 

The second theory states that the suburban context exercises a con 
version effect converting erstwhile Democrats or Labor Party sup 
porters from the central city into Republicans or Conservatives when 
they move to the suburbs Greenstein and Wolfinger, for example, 
have examined public opinion poll data on individual social character 
istics cross classified with a suburb-central city dichotomy, and they 
have found that even when political party identifications are controlled 
for education income occupation and ethnic group the suburban 
Republican tendency is not erased 

The third theory states that the right wing vote of the suburbs is a 
result not of conversion but of selective migration of Republicans as 
opposed to Democrats from the central city This is a theory which has 
received some confirmation from the work of Campbell 
Examination of the research referred to previously, and of other 
literature on the problem of suburban voting behavior,’* suggests 
certain characteristics of previous research design which can be im 
proved m any further testing of hypotheses related to suburban effects 
on political behavior The problems that need attention are as follows 
(1) Suburbamsm has not been defined sociologically or economically. 
It has been defined as a political unit outside the boundaries of a central 
city and the specific socioeconomic characteristics of suburbamsm 
have not been ngorouslj isolated (2) Suburbamsm has been treated as 
one part of a suburban-central city dichotomy In this study, the 
assumption will be made that suburbamsm lies at one end of a con 
tinuum at the other end of which he characteristics exclusive to the 


!9«f Berger Horkwg CIosm Suburb {Bcike\cy University of Cal fomia Press 

nlV “Suburban Residence and Pohlical Behavior Public 

Opinion Quarlerl) 22 (Winter J958 1959) pp 483-489 

and R Wolfinger The Suburbs and Shifting Party Loyallics 

r Op 22 (Winter 1958 1959) pp 473^82. 

Campbell rr fl/ op at fooinoieS pp 453-467 

*be rclat onsh p between suburban residence and voting 
S M '“>"'<>"2 Whyle Of oft roolno.o2 

rp ^ Metropoln American Poliiieal Science Review 52 (1958) 
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central city. (3) Most of the previous studies have failed to discriminate 
between the suburb and other socioeconomic contexts that may be 
common to both suburb and central city, BosXoff has spelled our 
objections:** 

"^e lack of definite and meaningful patterns in political affiliation following 
the suburban move is perhaps best interpreted as a consequence of ex- 
ploratory investigation and inadequate conceptualization, rather than a re- 
flection of "reality” Furthermore, invesngators often proceed on the 
notion that suburbs are demoeraphicaJIy and organizationally inter- 
changcaWc units wherever they are found The only exception is a dis 
tmction between industrial and residential suburbs 

Similar critical comments have been made by Campbell ** 

In this study, suburbanism will be defined in the locational, economic, 
demographic, and social terms in which previous writers have defined it. 
Economically, the suburb is dependent upon a central city for its eco- 
nomic base. DemographtcalJy it is characterized by a relatiielylow 
density of population but a rapidly increasing population owing to the 
addition of migrants from the central city Socially, the suburb is 
predominantly middle class, younger rather than older, with high 
percentages of home ownership and married couples, but with low 
occupational sex ratios *’ 

The variables from which contextual dimensions have been derived 
are listed in Table 2 The data for all the variables but one have been 
derived from the (951 Census of England and Wales The final variable 
— distance from the central business district — is the mam road dis- 
tance m miles from the center of the City of London and City of 
Westminster constituencies combined to the geographical centers of all 
the other areal units employed The variables have been selected on the 
basis of three criteria first, Ihcir relevance m terms of the theory of 
voting behaiior,*® second their relevance for the definition of subur- 
bamsm. and third, the availability of data Six variables required 


'*A BoskolT, •‘Suburbs. Voting and Style of Life Cues to Patterns in Voting 
Shifu” (unpublished manuscript, Depanmeot of Soaotogy. Emoty University), p 4 
"> Campbell ef o/ op cit footnote 5. p 454 
The criteria employed draw heavily upon definitions suggested m the following 
works O D Duncan and A J Reiss, Soeul Omracltrisiics of Urb^n and Jturaf 
Communuiei /9i0 (New York Wiley. 1956). S F Fava * Suburbanism as a Way 
o^\,de^' American Socioiogicat Re\»tin Z/(1956) p 35 . „ 

'•For an .ntroduclion to the relationships bctw«n vonng behavior and 
economic variables, see S M Lipset Arfifico) A/®i (New York Doublcday. I960) 
^ Berelson.P F LazarsTeld, and W McPhec. lo/in? (Chicago University of 
Chicago Press, 1954) 
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logarithmic transformations m order to preserve linearity of association, 
and these are astensked m Table 2 
In order to derive contextual dimensions from these variables, mter- 
correlations were computed, and a principal axes factor analysis was 
undertaken The factors with eigenvalues greater than unity were then 
rotated to a normal varimax position in order to facilitate empirical 
interpretation of the factors Factor analysis is a highly appropriate 
tool in that it aims at accounting for the variances of the variables in 
terms of a limited number of factors or indices of the geographical 
structure Each variable has a loading or weight on each factor, and this 
loading IS in the form of a correlation coefficient measuring the degree 
of association between the factor and the variable The higher the 
loading for a variable the higher the degree of association between the 
respective factor and the variable ” Given the theoretical relevance of 
the variables incorporated into the compulations, a rotated principal- 
axes factor analysis allows one to specify in four continua or dimensions 
those aspects of the social geography of the London metropolitan area 
which are relevant to the political behavior of the population of the area 
and to the definition of suburbamsm For each factor, each areal unit 
has a factor score The factor loadings with values greater than 0 3 
are shown m Table 3 


The first factor accounts for just less than 36 5 per cent of the 
variance In terms of the loadings of the variables on this factor, it may 
e regarded as a social rank dimension The two variables loading 
highest on this factor are the social class variables Other indicators of 
social rank such as years of school education persons per room, per- 
centage o occupied males employed in manufacturing and percentage 
01 occupied males employed in white collar occupations also load quite 
high and in the expected directions on this factor The loading of 
flistance f^rom the central business district of the London metropolitan 
^ ^ ® icator of the locational attributes of suburbamsm — is 

end nfTh ” of ideal polar lyp^ the factor describes areas at one 

manlltn employed in 

crowdmp '"i educational attainment is low over 

the miHHi Pi'oportions of the population are found in 

At the proportions are found m the lower class 

which eHii 1 ° ^ ^ are typically middle class areas in 

which educational attainments are generally high few are employed m 

H H HarnSn a’tcs factor analysis can be found in 

PP 154-191 (Chicago Universuy of Chicago Press 1960) 
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Table 2 

The Vaiur^lcs 


Variable name Amplified Description of Variable 


• POPCHA 

• POPPEN 
15-44 

OY65 

MARR M 
MARK F 
1 It 

•IV V 

EDUC 

•PERS P RM 
HOS SHA 

• OCC SR 
%SERV 

•/,MANUF 

%WC 

• %TRPT 
COMES 


%COM OUT 
%COM LDN 
CBD Disr 


Population change by migration, 1931-1951 
Populaiion density 

Percentage of population aged between IS and 
44 

Percentage of population aged 6S and over 
Sei latto 

Percentage of males married 
Percentage of females married 
Percentage of occupied males aged 15 and 
over m Social Class I and lit 
Percentage of occupied males aged 15 and over 
in Social Class IV and Vt 
Percentage of occupied males aged 1 5 and over 
who left school before the age of 15 
Persons per room 

Percentage of households sharing a house 
Occupational se« ralio 

Percentage of occupied males encaged in 
service mdusines 

Percentage of occupied males engaged in 
manufacturing industries 
Percentage of occupied males engaged in 
white collar occupations 
Ptrcentage o/ occupied males encaged ir? 
iranspotiation 

Net outflow of commuters as a percentage ot 
labor force commuting outside constituency 
of residence 

Perceniace of occupied labor force commuting 
outside constituency of residence 
Pcrccniacc of occupied labor force commufmc 
to the cities of London and Westmmsicr 
UisiatKc from the center of the consiitucncy 
to central business district m miles 


tTh«c vanabl., 

Englandand 1951 ^ ^ coii 5 .«tsc.fihehfshot raiHed oecuranons 

SifUfee Compiled by author 
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Tables 

Selected Factor Loadings 



I 

II 

III 

IV 

POPCHA 

0 4466 

-07657 



POPDEN 


0 9068 



15-44 

-0 3223 

04535 


0 7541 

OV65 

0 4840 

04718 


-0 6446 

SR 

0 8669 

03814 



MARR M 


-0 6735 

-0 5368 

-0 3552 

MARR F 

-0 5524 

-0 7W5 

-0 3546 


I II 

0 9517 




JV V 

-0 8214 

03899 



EDUC 

-0 9627 




PERS P RM 

-0 5875 

04420 

0 3717 

0 4415 

HOS SHA 


0 8711 



OCC SR 


0 8782 



%SERV 

0 8507 


0 3746 


%MANUF 

-0 8721 


-0 3582 


%wc 

0 9247 




%TRPT 

-07592 

05178 



COMEX 



0 9185 


%COM OUT 


0 4333 

—07122 


%COM LDN 





CDD DIST 


-09360 



VP* 

36 48 

30 50 

14 57 

7 65 


iXw* comnbuiion of a factor to the variances of all the 


Source Cakulated by author 


'^hitc-coUar occupations — and in 
v.hich there is very little overcrowding of homes 

second factor accounts for 30 5 per cent of the vanance In terms 
of the loadings of different variables on this factor it may be regarded 
Lm '^'‘y ‘•■■nension The loading of distance from the 

factor se„“,',^"”' ’’'S'' ^“gg^ting high 

area L ^ this dimension on the periphery of the metropolitan 

of Lnubirn h "" "'"topol'tan area The loadings 

«re d,r«,orn?r^l Population density are also high and are m 
ofthc suburb^ I "'’‘'“"t regarding the characlerislics 

the proportion of women i„ the labor force the proportions of mates 
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and females married, and the proportion of houses shared; this latter 
variable IS a surrogate for home ownership The ideahtype suburb 
derived from this analysts is an area located at a considerable distance 
from the central business district, one with relatively low population 
densities but high population increase, high proportions of males and 
females married, a small proportion of women at work, and a low 
proportion of houses shared 

The third factor accounts for just less than 14 6 per cent of the 
variance of the variables The three variables indicative of the magni- 
tudes of commuting to work, out of given areal units load highest on 
this factor This factor may be regarded as a descriptor of a commuting 
dimension 

Finally, the fourth factor accounts for just over 7.6 per cent of the 
variance Its two highest loadings are descnptive of two aspects of age 
structure the proportion of the population in early adulthood has a 
comparatively high positive loading on this factor, whereas the pro- 
portion in the retirement age groups also loads fairly highly but in a 
negative direction 

CORRELATION DATA AND A CAUSAL MODEL 
The compulation of correlation coefficients allows one to assess the 
degree to which the proportion of the electorate voting Conservative 
and the prcporiioa setuaiiy turning <?ut to » ots art related, first, to the 
scores for the suburban-central city dimension and. second, to the scores 
for other dimensions obtained in the factor analysis The results arc 
shown in Table 4 Without taking info account relationships with other 
variables, it can be determined from the table that the suburban-central 
city dimension accounts statistically for 18 5 per cent and 21 2 per cent 
of the variation of the Conservative vote, and 49 per cent and 54 8 per 
cent of the voter turnout for the 1950 and 1951 elections respcciiveK 
In order to evaluate the degree to which the suburban-central city 
dimension is associated with these two aspects of voting behavior 
when other social contexts arc taken into account, one could compute 
partial coefficients of correlation Computation of partial coefficients 
of correlation, however, docs not allow one to assume asymmctf> of 
relationship It would be preferable — if it vvere possible — not on)> 
to evaluate the significance of the suburban-central city dimension m 
correlational terms but also to do so in causal terms This latter con- 
sideration leads to consideration of the work done by sociologists in 
inferring causal models from correlational data 
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The language of theory and that of empirical research, it must be 
noted, are quite cJifTcrcnt and alien one from another Thus, causal laws 
assume control of all relevant xanablcs. but such control can never be 
tested empirically Likewise, causality involves the notion of production 
of an cflcct or asymmetry of relationship, such asymmetrj is difficult 
to formalize in mathematical or logical systems 
Political scientists, sociologists, and economists have attempted to 
cvolse methods of obviating such difficulties of relating empirical 
research to theory , BlatocL, in particular, has done a great deal of work 
along these lines He has shown that it is possible to think in terms of 
causal models of reality that can contain simplifications such as the 
assumption that neglected factors left outside the system of inter- 
relationships operate m specified ways Such models refer to empirical 
generalizations and not to reality itself however, a number of alterna- 
tive models may yield the same predictions, and one can ne%cr actually 
establish a given model But one can proceed by eliminating or modify- 
ing inadequate models that gne predictions inconsistent with the data — 
the models of causal mechanisms that could not create the particular 
results obtained in the form of correlations Blalock’s technique relies 
heavily on the use of partial correlation coefficients In a given causal 
model, the partial correlation coefficients for variables that are un- 
related in the causal model are given predicted values of zero, such 
predicted values can then be compared with actual partial correlations 
in order to establish the adequacy ofdilTerent causal links Moreover, 
wheneier r -,2 3 « 0, * r,® Thus, the causal models to be 

tested have employed predicted correlations derived from the product 
of correlation coefficients For certain other variables that are not 
causally related via a third variable, the predicted simple correlation 
will be zero 

There are a number of causal models that could plausibly be postu- 
lated to relate Conservatism to suburbanism m particular, and to social 
contexts in general, there are also a number of models that could be 
suggested to relate voter turnout to social context However, the voter 
turnout and the proportion of the voters voting Conservative m an area 
arc so associated (r* = 10 9 per cent m 1951 and 27 per cent in 1950) 
that It seems potentially more fruitful to postulate a senes of more 


For a synihKis and exposition of these ideas see H M ^Wock Cmwa/ /«- 

firem-ts Nowx/}ermfnial Research (Chapel Hill University of North Carolma 

A^Simon. Social aad Raimnal (Hew YarV. Wley, I95Tj 

Chapter 2 
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integrated causal models Two of the theoretically plausible possibihtiK 
are set out in Table 5 together with predicted correlations and observed 
correlations 


Model J 

In the first model (sec Table 5) suburbamsm has both direct and in- 
direct causal relationships with both party preference and participation 
Suburbamsm acts indirectly on parly preference b> affecting social class 
location. It also acts indirectly on participation by affecting commuter 
location and hence identification with the community Age structure 
can plausibly be regarded as affecting party preference directly — there 
is a considerable literature on the relationships between old age and 
political conservatism’^ — and indirectly by affecting soaal class 
location increasing age does tend to be accompanied b> upward soaal 
mobility m the class structure and accompanying residential change 
Some of the predicted relationships are quite accurate in this model, 
the predicted relationship of 34 between participation and party pre 
fcrence is precisely as expected Other predicted relationships, however, 
are wildly inaccurate 


Model JI 

When faced with such an inadequate causal model as this, Blalock 
has suggested two rules of thumb for modifying the causal model (I) 
make changes where there are the largest discrepancies between actual 
and predicted values, (2) where possible, make changes first in causal 
relationships among variables presumed to be operating near the 
beginning of the causal sequence, that is those vanables that are taken 
to be the most independent of the others Both these rules of thumb are 
satisfied if the direct relationship between age structure and Con 
servalive vote in Model I is erased, and an additional indirect link be- 
tween suburbamsm and party preference is provided by age structure 
and soaal class structure m Model II (see Table 5) The discrepancy 
between predicted and observed for the relationship between age struc 
ture and suburbamsm — the two most independent vanables in Model 

** Commuting is included insofar as il increases or decreases community id'Titi 
fication and hence indirectly alTects participation it is theorized that the more time 
a person spends in his local community the stronger will be his identifkaiion with it 
There is ample evidence to suggest that ih» longer a person resides m a given com- 
munity the greater the 1 kelihood of hts participating in politics see Campbell 
el a! op cii footnote 5 p 464 

** See Lipset, op at footnote 18, pp 231 232. 
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1 _ IS one of the largest , if a causal arrow is drawn between these two 
variables, it also affects other large discrepancies such as that between 
participation and age structure The resultant causal model shows a 
closer fit betNveen predicted and observed correlations 

In terms of the relationships between suburbanism and other social 
contexts, on the one hand, and two aspects of the intraurban political 
geography of the London melropoblan area, on the other hand, this 
final causal model suggests the following conclusions 

1 Geographical variations of participation have sources dissimilar 
to the sources of geographical variation in party preference, 

2 The social rank and age structures of particular areas affect party 
preference but not participation, 

3 Variations m the magnitude of commuting affect turnout but not 
party preference, 

4 Suburban location affects directly both participation and party 
preference when other contexts arc taken into account, 

5 Suburban location is of strategic significance in that it is also 
related to the political behavioral variables by causal links of an indirect 
nature 


THEORETICAL IMPLICATIONS 

A causal model of suburban-central city political variations has now 
been elaborated If it fits at all, how docs it fit into broader theory con- 
cerning such political differentiation'^ The two broad streams of 
thought regarding suburban-central city differences will first be specified 
m detail, and I shall then try to relate them to the causal model It 
should be noted that almost all the work concerned with suburban- 
central city differentiation has been carried out by American workers 
withm the American context, and hence one must work by analogy 
between British and American conditions 

Concerning party preference first, there have been two explanations 
of suburban Republicanism in the United States the transformation or 
conversion theory and the transplantation theory The conversion theory 
postulates that the new suburbanite is an erstwhile central city Democrat 
who, upon taking up residence m the suburbs is subjected to relatively 


^TTiis d^r fit IS replicated for the 1950 data though the discrepancy betv^ecn 
prcdi^ed value and expected salue of correlation is rather higher for the participa 
tion-Conscrvaiivc vote relaiionshtp ( 30 smai SI) and for the partiapaiion-social 
rank relationship (^4 versui 44) 
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new stimuli that may lead him to change his political party idcntifica 
lion These stimuli mai beofcttotjpw first migration to the suburbs 
involves the novel assumption of property rights and its implications 
such as home ownership and the payment of municipal and property 
taxes (called rates m Britain) uhich favor identification with the poll 
tical Right if the individual can perceise the connection between his 
changed situation and party images Wood has expressed this line of 
thought neatly 

Suburbanites escaping from big city politics arc ready converts tothe 
small town set of political values They may have precious tiUlc equity m 
their houses but they think of themselves — and arc ihoughl of — as home 
owners 

A second novel stimulus affecting conversion may he the climafc of 
opinion The suburbs are located m areas that were once highly rural 
and dominated by the right wmg political parties Because Uieir 
organization is strong party workers can be icry effective m winning 
new adherents to the party among the suburbanites As Wood has 
written ** 

The Republican party lays claim to the suburban vole because it better 
protects their new interests and status and because it is the political faith 
of the old time residents whose friendship is cultivated by the newcomer 

Other workers who have favored the conversion theory have tried to 
identify the communication process bv which conversion lakes place 
Investigation of the pattern of social contacts in metropolitan ureas 
provides the conclusion that the suburb has a much higher degree of 
informal social contact or neighboring than the central city As San 
Duzer has put it 

The high percentage of married couples with young children the pre 
dominance of middle class status and the homogenety of the phyjial 
siting arrangements which are thoucht to be charaacnstic of suburbs arc 
precisely those which arc associated with a high rather than a low degree of 
neighboring 


** R C Wood Sut'urba /rs rnp/e tvid T^rrr /"i// fict (Boiion Uoi-jWen 
MifTlm 1953) p 137 

,» C ^An Aralys s of the D ffcwnce in Rcpubleanl^ Vniul 

V oic in aid and Their SuburS-buipuN shed PhD d «rUtK>n OeturtiKni of 

Geography State l/mverwtyof low* >962 p 17) 
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This explanation implies that because communication is faciiitated, 
attempted conversion to the dominant opinion of the area encounters 
little resistance 

Whereas the conversion theory postulates a change in voting behav lor 
after moving to the suburbs from the central city, the transplantation 
theory argues that changes in the voting behavior of suburban popula- 
tions observable since 1945 are owing to selective migration from the 
central city so that the suburbs receive migrants from the central city 
who already identify with the Republican Party The most compelling 
evidence in support of this theory comes from the work done by 
Campbell and his associates on the Presidential election of 1956 Their 
analysis points to the following conclusions (I) Fifty-three per cent of 
the one time metropolitan Republicans have moved out of the central 
city, whereas only 29 per cent of their Democratic Party counterparts 
have left, this dispanty holds true when holding financial ability to 
leave the central cit> and intergenerational mobility constant (2) 
Emigrants from the metropolitan center and m migrants to it are 
arranged in almost identical fashion on the party identification con 
imuum Members of both groups are dominantly Republican, whereas 
the lifelong inhabitants of the metropolitan centers constitute a per- 
sistent core of Democratic Party support The reasons for this, the 
authors speculate, he m the realm of social values 

In the absence of needed data, wc can only speculate about the meaning of 
these discrepancies in partisan response to intergenerational mobility 
It seems likely that fairly basic differences in social values are involved It 
may be that the Republicans, despite iheir own metropolitan origins are 
more often linked through endunng family tics to ancestral beginnings in 
small cities towns, and on farms Although at least one generation re- 
moved, they may reflect the consequences of an earlier era of urbanization 
and central aiy growth With a heritage of values and traditions congenial 
to nonmetropolitan life they may seize the opportunity created by oani 
pational success and escape from the metropolis The urban Democrat, on 
(he other hand, may be the child of a thoroughly urban culture Whether 
the metropolis of his family line once was Warsaw or Rome or Dublin, or 
Boston. Baltimore or New York, he may accept more often the way of life 
of the metropolitan center For him occupational success may provide the 
means to exploit and enjoy the treasures of the city, not freedom to leave 
the familiar for the unknown 

As far as central city-suburban differences in participation are con- 
cerned, little work has been done The only insights come from 

Campbell ei al op at footnote 5, p 467 
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Campbell a at who have cross tabulated for respondents the voter 
turnout in 1956 against thcpiacc where the respondent grew up and the 
pi ICC where the respondent nowlnes One time rural residents living m 
the metropolitan centers have the poorest turnout levels whereas those 
who grew up in the metropolitan center and still live there and those who 
grew up m a metropohttn center but have since moved out show' 
(he highest turnout levels CbmpbcH « at . comment ” 

The low turnout of formerfatmen rejects the agrarian political heniagc 
U 1 $ also consistent with (heir present relatively Joiv sense of pohiical 
eflicacy At the same time the migrant from metropolitan Jiving was sup- 
ported in his pjnicipatjon by a sense of cflicacy commensurate with hts 
objectively demonstraied ability to attain the higher goals among those 
valued b) our sooety 

The model elabonted tn this paper can be compared with previous 
Ihcor) in two respects (Dlheordertng of the dilTcrcnt contexts in Iheir 
relationship with political behavior, (2) the content of (he factors in 
terms of loadings The latter will be considered first 
The conversion theories tie in ostensibly with the content of the 
suburban-central c/ty fretor Percentage of houses shared can be taken 
as a surrogate for the extent of owner occupancy, and the second factor 
loads heavily on this variable Further, the content of this second 
ficior lends some support to the ‘•ease of communications in the 
suburbs" argument high proportionsof men and women m the suburbs, 
as opposed to the central city appear to be married, and the occupa- 
tional sex ntio IS low indicating a high proportion of women not 
going out to work and therefore having more time available for neigh- 
boring 

In causc-and elTcci terms, the implications of the causal model of 
suburban voting behavior m the London metropolitan area can be 
summarircd in the following manner 

1 Suburban location influences social rank location which influences 
voting for the Conservative Pirty. 

2 Suburban location influences age group location which influences 
socn! rank location which influences Conservative voting. 

3 Suburban location influences both preference for Conservative 
Party and political participation, hut its cflccts on these two facets of 
political behavior arc independent of one another, 


/WJ, footnote 5. p 464 
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4 Suburban location influences commuter location, which influences 
political participation 

It IS difficult to relate the form of this model to previous theory The 
model is especially weak in three areas (1) It is based on aggregate data 
concerning structural effects of social environment on individual 
political behavior, whereas most previous work on suburban voting 
behavior has been based on sample survey data for individuals (2) 
There are a great many intervening variables that have been omitted 
owing to lack of data, although one can see that suburbanism affects 
party preference independent of other contextual factors, one is still 
left in a quandary as to how this effect occurs, that is, is there a con- 
version or transplantation effect^ (3) Although couched in causal 
terms, the model fails to look back into the past for factors that might 
affect the dynamic interrelations of the contexts with political behavior 
today The only variable descriptive of change in the social structure 
incorporated into the factor analytic model is ‘ Population Change 
1931-1951 " A closer look at possible historical factors is illuminating, 
however 

In terms of their changes through time, the political and social 
geography of England are notoriously difficult to manipulate Thus, m 
the past fifty >ears the boundaries of the Parliamentary constituencies 
for which voting figures arc available have changed out of almost all 
recognition Moreover, the boundaries of these constituencies m earlier 
periods have frequently coincided only poorly with the boundaries of the 
geographical units for which social data arc available Nevertheless, it is 
possible to provide some evidence on historical changes in voting be- 
havior in the London metropolitan area in a nonrigorous research 
design, the answers from such a design as is envisaged, however, can 
only be suggcstiv e and not conclusive 
Three elections at approximately 25 year intervals — 1895 1922, and 
1950 — have been chosen and the percentage Conservative vote com 
puted for every constituency in the study universe in each election The 
London metropolitan area has been divided into four concentric belts, 
and the mean percentage Conservative vole for each belt for each 
election has been computed (Figure 4) Although the names of con- 
stituencies have in large part retained their identity over time, their 
boundaries have tended to fluctuate a great deal The concentric zonation 
cmp1o>s the constiluencies as they were in the election of 1950 and 
attempts to group them into zones on the basis of their centers of 
gravity Ctearl), the centers ofgravily for the constituencies in 1895 and 




Figun 4 

A COSCtNTBIC Z(A.^TI(>N Of THE LONDON MeTHOTOLITAN ARCA 


l922<}ifrerod somcHlnt bui the hmitation to onij four zones should 
ruminnze iruccuracies inherent w such an areal division The resultant 
statistics are labiilatesl and presented graphically in an attempt to 
demonstrate thedepree to uhfch and the manner in which thegradient 
of political opinion between the central city and the suburbs has 
changed over time (Table 6 and Figure 5> 

In statistical terms, the relevant results of these computations are as 
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Table 6 


Mean Conservative Vote for 
Concentric Zones» 1895, 1922, and 1950 



tg95 

1922 

1950 

ZONE 1 

621 

560 

42 0 

ZONE 2 

62 3 

56 6 

38 9 

ZONES 

76 9 

548 

449 

ZONE 4 

85 1 

67 9 

497 


Source Cateul&itd by author 


follows (1) The difference between the percentage Conservative vote in 
the central city and in the peripheral belts decreases over lime (2) 
Suburban zones exhibit a tendency over time to approach parity with 
the central city in terms of the percentage Conservative vote (3) This 
tendency toward parity extends itself spatially over lime outward from 
the central city 

In terms of the social geography of voting m the London metropolitan 
area, these results suggest the following First, the suburban areas have a 
heritage of high proportions voting Conservative, these proportions 
have diminished over time, probably indicating that decreasing numbers 
of immigrants from the central city have assumed a Conservative Party 
identification This is congruent with Lazerwilz’ recent findings for a 
period m the late I940’s and early 1950’s m American cities that, far 
from decreasing, the Democratic vole of recent migrants to the suburbs 
has actually increased Second, identification with Left wing political 
parties has spread outwards from the central city so that although 
remnants of the Conservative hentage sliH remain on the outer periphery, 
they have Jong since been erased nearer the central city 

The sources of this Conservative hentage are more difficult to identify 
To a certain extent, however, it docs seem to be independent of soaal 
Cornford examined certain simple associations for the 
1 885 election in the Metropolitan constituencies of London , he found ’ ‘ 


La7*n.,tz. “Suburban Voiing Trends 1948-1956 “ Social Forcer J9(t960). 

rS*' Oj^nizanon by the Bnt«h Omsenauve 



VOTING IN THE LONDON 5UBUSBS 

MEAN PERCENTAGE 
CONSEKVATIVE VOTE 
JOO *7 


90 



30* 

20 - 

10 - 

ZONE! ZOhteZ ZOME3 Z0NE4 
flgare} 

Change in the Relationship between Mean Percentage 
Conservative Vote and Four Concentric Zones 1395, 1922, and 1950 


1) That there was a high conviation (+074) between the Conservative 
percentage of the poll and the prosperity of the constituency, using rateable 
value per head of population as a incasure of prosperity, 

and 

2) That where two constituencies bad the same rateable value per head, the 
outer and less congested district tended to be more Conservative 

Less evidence is available on the precise nature of the Conservative 
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suburban vote in terms of social sources when social class is held con- 
stant Comford suggested, however, that^^ 

the outer suburbs, in the county divisions of Middlesex, Surrey, Kent and 
Essex were still the exclusive preserve of middle class commuters in 
1885 


CONCLUSIONS 

A firm conclusion that can be derived from this analysis is that sub- 
urbanism exercises effects on both party preference and participation 
independent of other social contexts Suburbanism also affects political 
behavior indirectly by influencing social class structure, age structure, 
commuter locations, but it also has eflects of its own 
The nature of these direct effects on party preference is more difficult 
to specify Aggregate statistics cannot tell one anything with a great deal 
of conclusiveness, bearing in mmd the constraints on inference implicit 
in aggregate data, the following statements may be made It seems that 
there is evidence for both the transplantation and conversion theories 
As far as the conversion hypothesis is concerned, the content of the 
suburban factor specifies both a stimulus (low proportion of homes 
shared as a surrogate for low proportion of homes rented) and a higher 
communication probability (high proportions of men and women are 
married, and fewer of the women go to work) The additional data on 
the dynamic component of party preference suggest that the conversion 
effect IS less likely to be political in character as compared with former 
periods of time The transplantation hypothesis, on the other hand, is 
supported by the suburban-agc-social rank relationships It is not 
difficult to see a relationship between, on the one hand, suburban loca- 
tion affecting age structure location, affecting in its turn social class 
structure location and, on the other hand, relationships between life 
c>clc, social mobility, residential mobility, and party preference for 
cc^m central city elements in the manner inferred by Campbell el a! 

There seems no good reason why one should choose one theory over 
the other when taking into account all the available evidence The 
CampWl transplantation hypothesis is highly convincing, but it gams 
part of Its conviction from the fact that the argument is presented in 

aihcr.ortcnns,eilhcrthcrcspondcnthasaRcpublican identification ora 

cmocratic Party identification, no allowance is made for variations 
along the party idenlificalion continuum which are known to exist 


** /6W, fcx5inote 31, p 4J6 
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Hence, if llie probability for a central city Republican to migrate to (he 
suburbs is high, and the probability for a central city Democrat to lsa\e 
the central city is low, then one would (end to allocate a probability 
intermediate m magnitude to the weakly identified Democrat, these 
are precisely the persons who would be more likely to be converted in 
party preference on movingloasuburban location 
The problem of suburban-central ctlj differences in political be- 
havior has now been attacked from two methodological angles survey 
data and context, the next stage is to combine both approaches m an 
effort to resolve the problem of the sources of distinctively suburban 
political behavior, such analyses are now being initiated with altitudinal 
and social data for Columbus Ohio and with social role data for the 
London metropolitan area 
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Third Survey of London Life and Labor; 
A New Typology of London Districts' 


'Hie definition of subareas witbm a larger urban area may be justified, 
on practical grounds, as a necessary step toward effective administration 
at the local level Indeed, for town planners and local housing authori* 
ties, planning could hardly proceed without the framework of land-use 
zones, neighborhood units, environmental areas, and traffic zones which 
subdivide the administrative units of local government For the socio- 
logist, the identification of subareas within the city provides an illumin- 
ating and fruitful approach to thestudy of city life Whether the aim of 
the investigator is the validation of generalizations m urban sociology 
or the development of planning strategy for the allocation of scarce 
resources, the definition of urban subareas is an important analytical 
step. The purpose of this bnef paper is to outline one method of defining 
subareas, m this case m Greater London,* by using a new form of dala 
and a new procedure of classification 
In the United States the availability of data for census tracts covering 
a period of twenty years (or longer m some cities) has generated con- 
TtstraitVi w. wbdwsww. -aC vw 

which census tracts arc used as the basic units Various methods of 

• The research proicel reported m thu paper was conduciwJ ai Ibe Centre for 
Urban Studies, Univenity College London, by Wolf Scott. Nigel ffowanj. and Peter 

Norman under the direction of Ruth Glass . . „ , . 

» • Gieaier London ’* in Ihis paper, means the area co>-ered by the Greater London 
Council for admimstrali'c purposes * latter London*’ « the pan of this area which 
was previously administered by the London County Cbunal, ’Outer London 
means the remainder of the Greater Uwwion Oouneil area 

37J 
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social area analjsis, cluster analysis scale analysis, and, recently, 
factor analysis have been devised and tested for census tract data 
Most methods seek to establish the dimensions in which census tracts 
can meaningfully be differentiated and to allocate them to a number of 
groups on the basis of scores for each of the important (usually complex) 
variables 

In marked contrast to the detailed material available m the United 
States, would be analysts of urban structure m the United Kingdom 
have been starved of adequate information Only very limited data have 
been produced for areas smaller than the basic administrative units of 
the country For censuses prior to that of 1961, data supplied for 
electoral wards and civil parishes covered only the very basic items of 
area enumerated population, and numbers of households and dwellings 
A few studies have made use of more detailed census statistics secured 
by special arrangement with the Census Office,^ but, on the whole, 
research has depended on published data for complete administrative 
units ranging m size from the one million population of Birmingham 
County Borough to a population of less than 2,000 in some small Urban 
Districts * 

As m so many fields of research, the situation has been radically 
altered by the advent of the electronic computer The use of a computer 
in processing the results of the 1961 census enabled the Census Office 
to produce a wide range of statistics for the smallest areal unit of the 
census operation — the enumeration district An enumeration district 
IS the area covered by a single enumerator during the collection of 


* Some important references are Robert C Tryon Idenllficaiion of Social Areas 
by Cluster Analysts {Bci\.e\cy University of California Press 1955) EshrefShevky 
and Wendell Bell Social Area Analysts Stanford Sociological Series No 1 (Stanford 
Stanford University Press 1955) Oitvin Schmid Generalisations Concerning the 
Ecology of the American City American Sociological Ee\ieH 15 (April 1950) 
C Schmid and K Tagashira Ecological and Demographic Indices A Method 
ological Analysis Demography / 1 (1964) A brief review of these methods and 
further references can be found in M D Van Arsdol F Camillcn C Schmid 
and McCannell Methods of Differentiating Urban Social and Demographic Areas 
papers presented at the Census Traa ConferetK* December 29 1958 US Bureau 
of the Census Working Paper No 7 (Washington DC US Department of 
Commerce 1959) 

b-lfe and Labour of the People in London (London Macmillan 
|9(H) tl Llewellyn Smith The Aen Survey of London Life and Labour (London 
P S King, 1930) Ruth Glass The Social Background of a Plan a Study of Middles 
brough {London Routledge 1948) 

adminmrative areas of England and Wales for which census information was 
publ shM in 1961 are 62 administrative counties 83 county boroughs, 29 mciro- 
poliian boroughs 317 municipal boroughs, 564 urban districts 474 rural districts 
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census information, data covcnng most ofthepopubtionand household 
characteristics investigated by census methods were made available for 
each of approximatel> 60 000 such areas m England and Wales, where 
previously they had been available onl> for the 1,500 administrative 
areas of the country For the convcojcncc of users this materia! was 
made available on punched cards or magnetic tape as well as in printed 
form, therefore research involving large numbers of enumeration 
districts was practicable The project reported here involved processing 
data at one snge or another, for approximately 11,000 enumeration 
districts and 845 electoral wards in Greater London The project used 
these data to provide a detailed factual description of the internal variety 
of London’s demographic, housing cihnic and class characteristics and 
to arrive at a general typology of districts which would best define its 
subareas m terms of these characteristics Research of a similar nature 
IS taking place m other major cities and conurbations m the United 
Kingdom * 

Analysis of the data for London involved three mam phases 

1 The computation and individual analysts of a set of intrinsically 
interesting variables for enumeration districts and electoral wards 

2 Analysis of the interdependence of the variables by correlation 
methods 

3 Classification of enumeration districts and wards into a varying 
number of types and the further analysis of the typologies 

Each phase will be considered separately 


Ff//iS£ I COMPUTATION AND ANALYSIS OF 
INDIVIDUAL VARIABLES 

Before proceeding further, it is important to make quite clear the 
nature of the data An enumeration distnci is an area designed primarily 
for administrative convenience, to provide each enumerator with a 
clearly defined task and as far as possible, to ensure that the amount of 
work invorvedfor each Jb# 

country The variation m population size of enumeration districts is 
therefore slight although unusual districts do occur with particularly 
large or small populations In London the population of districts 
ranged from only 70 to over 1,000. with an average size of 660 persons 

* Similar research pmjcc.s ha., ''' 

West Midtands. Ntanchesier, Sunderland, and Hampshire. 
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and 240 households, approximately 90 per cent of enumeration districts 
lay within the range from 400 to 1 ,000 population 
The data for each district refer to the population of the district as a 
whole, and no distinction is possible between the characteristics of 
particular subsections Ideally, the analysis was to have covered the 
population in private households alone, excluding people enumerated 
m hospitals, hotels, and other institutions, but because of the limitation 
of aggregate statistics this was not feasible As an approximation to the 
ideal, all districts in which the ratio of persons to households was greater 
than ten to one were excluded from the analysis, the assumption being 
that the population m these districts lived mainly in nonprivate in- 
stitutions of one sort or another Fortunately, large institutions usually 
make up one complete district and are satisfactorily excluded m this 
way, but small hotels, children’s homes, nursing homes, and other in- 
stitutions included in an ordinary enumeration district cannot be 
separately excluded In Inner London, out of a population of 176,000 
living in institutions in 1961, 97,000 were excluded from the analysis 
by this criterion, the populations of 369 districts 
Some of the most important items of information, relating to socio- 
economic status, were collected for only one m ten of the enumerated 
population Variables derived from these data are affected by random 
sampling errors, and scores for particular enumeration districts must be 
treated with some caution However, the occurrence of scores differing 
by more than two standard errors from their true value could be ex- 
pected not more than one in twenty limes Hence, the over-all geo- 
graphical configuration of maps based on sample data, and the results 
of a classification that included sample data as part of the input, could 
be accepted with some confidence "^e data are also affected by an in- 
dividual enumerator’s reliability Quality is likely to vary from district 
to district according to the enumerator’s precise understanding of work- 
ing definitions and instructions But no assessment ofeach enumerator’s 
results IS possible from the edited material, and the data were therefore 
accepted as presented 

The set of 42 variables computed for each of the enumeration dis- 
tricts are listed m detail m Appendix B To some extent, the field of 
“important” or “relevant ’ variables has already been determined by the 
process of consultation which precedes the detailed choice of questions 
by the Census Office The variables computed for each district covered 
all fields of the raw data, each variable was assumed to be an in- 
trinsically interesting difierentiating dimension between districts, and 
each was given equal importance in the analysis in the first instance The 
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enumeration of the population levered the following 


1 fopublloti structure indices of age structure, marital condition 
sex ratios 

2 Household structure indices of household size and compostdon 

3 //oiising coffc/ifioiw indices of room density, sharing of dHclIings 

and household facilities 

4 Tenure oj accommodation proportions of households m owner 
occupation, council tenancies,* and pnvale furnished or unfarmshed 
tenancies 

5 Or/g/n proportions of residents born in each of seven separate 
areas or countries outside Great Britain 


Variables computed from a sample of one in ten of the population 
covered the following 

6 Mobility of the population proportions of immigrants into the 
area during the previous year 

^ Socioeconomic status indices of occupational status, educational 
level, commuting 


As standard procedure the unweighted means and variances of each 
variable were computed separately for the districts of Inner and Outer 
London and frequency distributions of each >anab!e were drawn up 
for each sector 

The distributions of individual variables and the gcographicil 
locations of districts with particularly high or low scores’ indicated the 
extreme complexity and diversity of London particularly of Inner 
London Despite their smalt size enumeration distnets are rarely homo- 
geneous in population or conditions There were in J96J no districts 
m which the population was made up entirely of any particular occu- 
pational category or immigrant group identified by our variables and 
few m which the population were all living under the same housing 
conditions Forexample one might expect that a relatively large number 
of districts containing only 2*13 households on the average would con- 
tain households from only one of the four tenure categories listed under 
Group 4 In fact, only !4 per cent of the districu of Greater London 
had over 90 per cent of households m any of the four tenure categories 
identified 


‘ “Council tenants are heuscMfd* oecupyin* accoinnxxlalion suppled by a local 
admimstratiN-c authority cr by a New Town Cc*rporafto« 

’An e^irwTK score Is defined in relalion to ihc frequency dstniwliofl of each 

ind.stduat variable and WJJJ wry as the range of «bc vanafcle wnes 
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Similarly, the immigrant composition of districts is best described in 
terms of a wide range of mixed areas rather than as a polarity of highly 
segregated, ethnically “pure” districts In Inner London alone, at one 
extreme there was only one district analy-zcd which, in 1961, had no 
people born outside Great Britain," at the other extreme the heaviest 
concentration of persons born anywhere outside Great Britain was only 
66 per cent The highest concentration of any immigrant group that we 
identified was 36 per cent of the population born in the West Indies 
Persons born m both parts of Ireland nowhere accounted for more than 
30 per cent of the population of an enumeration district 
In terms of occupational status, a simple dichotomy of occupations 
into manual and nonmanual showed one third of the districts of Inner 
London with at least 80 per cent of males in one category or the other 
But when a more meaningful subdivision was used, separating the pro- 
fessional and managerial from other nonmanual workers and skilled 
from unskilled manual workers, only 17 districts had 80 per cent or 
more of males in any category, only tl per cent of districts in Inner 
London had 60 per cent or more males m any of the four occupational 
categories ® This does not mean that there was no class segregation , the 
geographical clustering of districts with relatively large proportions in 
higher-status occupations shows that there was The important point is 
that, although o\er half of the cmplojers, managers, and professional 
people of Inner London lived m only 20 per cent of the enumeration 
districts, they very rarely lived m homogeneous enclaves 
An extensive senes of maps, locating districts with extreme scores on 
particular variables, added further detail to the picture of London's 
diversity The most stnkmg feature of the maps in general was the ex- 
ceptional character of the North-West quadrant of Inner London com- 
pared With the rest of Greater London Many of the maps showed 
clustering of extreme scores only in this sector Extreme concentrations 
of single adults and of furnished accommodation fell entirely in this 
quadrant, other characteristics, although occurring in extreme form 
elsewhere, showed their greatest geographical clustering within this 
sector Concentrations of children under five of overcrowded house- 
holds, and of Irish West Indian, and Asian immigrants also occurred 
here On the other hand, the higher occupational groups, extremes of 
both high and low average room density and ofhigh and low sex ratios 
were also concentrated here 


Britain** means anywhere except England and Wales Scotland 
the Channel Islands and the Isl- of Man 

* ^^four occupational categories are defined m terms of the Registrar-General s 
soaocconomic groups n Appendix B — variables 40, 41 A, 41B, and 43 
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Clustering of many l>pes of eitremt districts in the 
North'West quadrant of London, districts with extreme \afucs of any 
one index tended to have qvnle specific and nonoxcrlappmg locations 
within the sector Areas with the highest proportion of furnished 
accommodation occurred immediately adjacent to. but not in, major 
immigrant areas, single adults were concentrated in districts peripheral 
to the districts of highest occupational status Although within this part 
of London concentrations of some characteristics did coincide, they 
tended also to occur in isolation elsewhere, in particular, although 
relatuely large proportions of Irish coincided with concentrations of 
West Indians in North Kensington and with Cypriot arms in Camden 
Town, the Irish alone were concentrated in Hammersmith, West 
Indians in Bnxton, and the Cypoms in Islington 
In addition to the characteostic geographical distributions of many 
of the variables just described there was also a polarity of extremes 
between the districts at the center and those at the outer edge of the 
Greater London area In particular, the proportion of children aged 
under fifteen and the proportion of households that owned their 
accommodation were both extremely low m central districts and high 
in districts at the outer edge of the area 
The geographical distributions of the variables we considered are not 
easily summarized in terms of some simple dichotomy (for example 
between East End and West End) The contrast between suburb and 
center was only one part of the pattern, equally important was the con 
Irast, at the center between extreme characteristics of districts lying 
within a small geographical compass 

PfIMSE 2 INTERDEPENOE/VCE OF THE VARIABLES 

The second major phase of the anal^is investigated the inter- 
dependence of each of 38 major variables with every other A square 
matrix of product-moment eoeflicicnls of linear correlation was com- 
puted from the variable scores of all the 4 570 enumeration districts of 
Inner London excJiidmg variables 4 25 31 and 33 (See Appendix B ) 
The matrix was analyzed b> several methods each of w hich is reviewed 
briefly in this section 

The complex geographical disinbution of characteristics as well as 
the diversity of conditions outlined m the previous section, was re- 
flected in the small size of most of the coeffiaenu m the matrix As (he 
correlation coefficient is an aienr^^e measure of association areas in 
which variables vary together are counterbalanced by areas in which 
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they vary independently of each other The diverse combinations of 
characteristics in Inner London therefore exerted an important influence 
on the values of the matrix, for example, one might, a prion, expect a 
straightforward relationship between occupational status and housing 
conditions — between the proportion of males in manual occupations 
and the proportion of households living in shared dwellings However, 
the relationship is not a simple one In council housing, where the 
occupational status of tenants is usually lower than average, there is 
very little sharing of dwellings At the other extreme, sharing of dwellings 
IS quite common in central areas of high occupational status where it is 
part of the cost of a central location In consequence, the correlation 
between index 18, the percentage of households sharing a dwelling, and 
index 41, the proportion of males m manual occupations, was only 129 
The correlation between index 1 8 and index 40, the proportion of males 
m professional and managerial occupations, was only — 141 
This type of complexity resulted m few high correlations among the 
variables apart from those of a technical nature Variables that were 
closely related by definition, particularly indices of sharing of dwellings 
and facilities and of room density and overcrowding, had high, but 
trivial, coefficients of correlation among tbemselves Apart from 
correlations of this sort, only two were particularly high, index 2, the 
proportion of children aged under fifteen, had a correlation of — 800 
with index 12, the proportion of households consisting of one or two 
persons, index 38, the proportion of early school leavers, had a cor- 
relation of 827 with index 41, the proportion of males in manual 
occupations, but even these correlations have an obvious logical ex- 
planation All other nontechnical correlations were smaller than ± 800 
Simple inspection of the coefficients showed that there were several 
important groupings of variables with relatively high intercorrelation A 
first stage the analysis of these grwwpmgs isolated all gtowps of 
variables with a minimum correlation between every member of ± 500 
Groups isolated m this way ranged m size from 2 to 7 variables and 31 
of the 38 variables fell into one group or another When the minimum 
level of intercorrelation was raised to ± 600, the largest group con- 
tained Only 4 variables, but 23 of the variables still fell in one group or 
another 

Although many of the groupings were the result of purely technical 
correlations, a particular set of variables occurred m several of the 
groupings which could not be explained in this way At the minimum 
level of r = i 500, the following variables occurred together with a 
minimum intercorrelation of — 523 
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yartabk Np 
2 
9 

11 

12 
24 
32 

38 


l^scnptfan 

Per cent population aged under 15 fnec ■) 

Per cent single adults 
Per cent one-person households 
Per cent one- and two person households 
Per cent households renting furmshed accommodation 
Per cent Iwn outside the Bntish Isles and the Mew Com- 
monwealth 

Per cent early school leavers (neg > 


At the natimam kvc! otr^ ±m. <.anahks 9 , 1 1 24 , and 32 stiti 
occurred in one group 

A more rigorous grouping procedure, analogous to the procedure 
outlined later for classifying couineration districts, divided all lanables 
ssithacorrelation larger than i SOOintoanumberofexcIusivescts.each 
variable occurring in only one of the sets Thirty-one variables «ere 
classified into sesen groups sucb that ibe average conelatton tiflhm 
groups was a maximum and that between groups a minitnutn TTte 
results of this grouping procedure indicated that the most mporiant 
genera! axes of difTerentiation between the enumeration districts of 
Inner London were 


«J Socioeconomic stattis. variables 3S, 40, and 41 
b Sharing (of dwellings and household facilities), variables J8, 19, 
20, 21, and 30 

c. Room density, variables 14, 15, 16. 22, and 36 
d 7>flnsie;ice, variables 2, 9, U, 12, 13, 23, 24, 26, 28, 32, and 37 
Three minor factors were (1) fertility, vanablcs 1, 5, 8, (2) the sex 
ratios, vanablcs 6, 7, (3) and two indices or isolated old people, 
variables 3 and 27 The variables not included in any group were vari- 
ables 1 0.1 7. 29.34. 35. 39. 42 

Another stage involved a principal component analysis of the cor- 
relation matrix The primary aim of the component analysis in this 
instance was to reduce the computational task of the classification of 
enumeration districts at the next stage, the first 14 scores of the 38 
possible pnncipal components accounting m all for 84 per cent of the 
onginal variance, were used as input for the classification of the 4,570 
districts of Inner London m place of the 3S original variable scores, 
analyzed in the correlation matrix Although the principal components 
were regarded primarily as a computational convenience, they also 
gave an indication of the most important dimehsions underlying the 
variation between enumeration districts The list of variables most 
highly correlated with the first principal component included most of 
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the \ambfcs associaud v.iih socioeconomic status and “transient ’ 
(sec the sarmMcs listed caihtt), the first component accounted lor 
exactli one quarter of the total sariance Variables most highly cor- 
related nith the second component v.erc the variables of sharing and of 
room density and accounted for a further 19 per cent of the total 
variance The variables most highly correlated vvith the first two pnn- 
cipal components are listed here m rank order 


Component I Variables Shomng the JJighest Correlations 
hi/A Component / m Rank Order 


Variable No 
9 
2 
12 
38 
11 
32 

24 

26 

41 

40 


Description 

Per cent single adults 
Per cent aged under 15 
Per cent one and tuo-person households 
Per cent early school leavers 
Per cent one person households 
Per cent bom **elscwhcre” outside Bnnsh Isles and 
New Commonwealth 

Percent pnvatetenanls./irrmiArdaccommodation 
Percentone personhouscholdswithoneoldperson 
Per cent males tn manual occupations 
Per cent mates m professional and managerial 
occupations 


- 839 
836 

- 822 
816 

- 809 

- 794 
-J83 

709 

698 

- 653 


Component II Variables SfiOHing the Highest Correlations 
iMih Component II in Rank Order 


Variable No 
15 
20 
19 

21 

14 

16 

18 

1 

30 


Desenphon 

Per cent ofall households overcrowded —845 

Per cent households shanng W C -* 730 

Per cent households shanng dwelling, stov e, and 
Sink - 679 

Per cent households shanng dwelling, stove, sink, 
andWC - 678 

Per cent five- or more person households 
overcrowded — 650 

Tvo of persons per room — 629 

Per cent households shanng dwelling ■" 628 

Per cent aged under 5 — 626 

T*er «nt bom in Bntuh Canbbean - 601 


The results of these analyses arc not entirely conclusive The prin- 
cipal component analysis suggests that the two most general factors of 
difTcrentialion between distncts are an index of “soaal status ’ including 
the elements both of social class and of mobility or "transience,” 
second, a general index of housing conditions including the elements 
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both ofsharingand of room density *®Simp!eranaljstsofthecorrcialion 
matrix suggests that these two general axes arc better divided into four 
Ihe most interesting feature however. « the indication that some 
measure of the mobility of the population both actual (in terms of 
length of residence and birthplace) and potential (m terms of marital 
status, household si 2 e, and t)pe of accommodation), u an essentnf 
differentiating factor between districts of Inner London, although we 
do not yet know whether this is so in other British cities 


PfJASE 3 A NEW TYPOLOGY OF LONDONS 
DISTRICTS— METHOD AND RESVLTS 

The two phases of the analysts described were, in many wa)’s only 
essential preliminaries to the construction of a typology of London's 
subareas, the features of individual variables and of their mtcrrclation- 
ships which have been described were chosen primarily to illustrate the 
nature of (he typological problem In this section I shall describe the 
method and results of a classification of the enumeration districts of 
Inner London and the wards of Greater London into a varying number 
of groups The method of classification and the algorithm by which il 
proceeds were developed by Nigel Howard for this research project (see 
Chapter 16 of this volume), my description of the method summarizes 
the mam points of hts full miihcmatical exposition to be published 
later From the methodological point of view, the classification of 
enumeration districts may be regarded as a particular application of a 
generally applicable method of classification 
A number of previous studies have provided a typology of areas 
within London Charles Booth s survey Li/e ami Labour of the Peopkof 
London produced a senes of maps m which all blocks of buildings in the 
streets of Inner London were allocated to a set of categories according 
to the social status and income of their inhabitants *' Similar maps 
were produced by the invesli^iors of The Acvi Sur\e\ of London Life 
and Labour in the 1930 s " A general typology of the wards of the 
Greater London convrbstian based <m census data was produced by 
the Census Otlice in the 1951 report on the conurbations, but the 


A vanmax facioranalysnoriheeemrtaiioninatm for 
by Miss n.rabcih G.uus as part ofa 

Inter Universities Census Tract C<>tiimrtK« 6«es results similar lo ihe nsultv of 
principal component analysis 


* * Booth. Dp cir 
“ Scrtiih, op cu 
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typology did not follow a rigorous method A typology restricted to 
administrative areas as the basic unit is provided in a statement of 
evidence to the Royal Commission on Local Government in Greater 
London, presented by the Centre for Urban Studies The typology 
reported here departs from its predecessors not only in the use of a new 
form of data but also in the rigor and flexibility of the results The 
method is a development of that used by Moser and Scott in their 
classification of British towns “ 

In general, the desirable properties of any classification arc that the 
groups identified should be as homogeneous as possible and that they 
should be as distinct as possible in respect to all the variables or attri- 
butes that are considered relevant to the classification The problem of 
classification is not fully defined without, first, a specification of the 
number of groups to be established and, second, the definition of some 
measure of the homogeneity of the results In the classification pro- 
cedure described here, results were produced for a varying number of 
groups, ranging from two to thirty, this allowed us to inspect the re- 
sults and choose the most desirable for any particular purpose 
The measure of homogeneity derives from the analysis of variance, 
for any individual group of any classification, the measure of its 
tnhomogeneity is defined as the sum of the squared distances of every 
element (in this case every enumeration district) from the mean of its 
group Where, as in this case, more than one variable is used, squared 
distances for each element are summed over all variables used in the 
classification Comparability between difTcrcnt measures is achieved by 
standardizing all scores in units of standard deviation from their 
original means, this gives each variable a standardized mean of zero 
and a variance of one ** 

The inhomogeneity of an entire classification is measured by the 
summation of the inhomogeneity of the individual groups over all 
groups Where, as in this case, all variables are measured as deviations 


'* General Register OfTice 1951 Census, Report on Greater London and Five Other 
Conurbations (London H M S O 1936) 

Westergaard The Structure of Greater London in London — 
Centre for Urban Studies (London MacGibbon and 

^ ° Statistical Study of Their Social and 

and Boyd Studies Report No 2 (Edinburgh Oliver 

way also simplifies the mlcrprctat.on of 
n^mN-rof^nH !i variation of the popuUtion m standard form, is equal to the 
cUssificalL « ciiicdtut' *>» a' "umber of variables by which the 
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from (hccr respective means in the entire populalion, the inhomoeeneitv 
represents the “ivithin group sanation” in the analysis of-variancc 
equation 


Total vanation = withm-group variation + between-group variation 
The complementary item of the total vanation, that between groups, 
ma> therefore be regarded as a measure of the homogeneity of the entire 
classification 

When the homogeneity of a classification is measured m this way, the 
problem of finding an “optimum*’ classification is that of delimiting 
groups so that their total within group variation is minimal, when this 
clement of the total vanation is minimized, (he variation between groups 
is maximized The computer program for finding an optimum dasst* 
fication works from the principle that a classification that has maximum 
homogeneity, as defined earlier, also has the property that each clement 
in the population classified is closer to the mean of its own group than 
to the mean of any other group, the program wj)] improve any given 
classification until it has this property The algorithm starts with agiven 
classification into P groups and computes the mean vector of each 
group The squared distances of each element from the means of each 
group summed over alt the variables arc compared, if the squared dis- 
tance of any element from the mean of its own group summed over all 
the variables is greater than, or equal to, its squared distance from the 
mean of anv other group, the element is transferred to the group whose 
mean is closest to it New means are then computed for the groups 
involved m any transfers, and a new set of comparisons is made The 
process continues until a test shows (bat every element is closer to its 
own group mean than to the mean of any other group 
The classification achieved by ihis algorithm is a “local” optimum m 
the sense that the homogeneity could not be improved by the transferor 
any single clement from one group to another It is not necessarily a 
“global” optimum in the sense that the homogeneity might not be in- 
creased by a different initial starting classification In any succession of 
classifications it is therefore important that the algorithm should start 

with a “good” initial partition of the population In practice the com- 
puter starts the process bysubdividinglhepopulatioaintothoseelemenJs 
which he above and those which lie below the mean of the variable that 
has the greatest variance (In the classification of enumeration distncts 
of Inner London, the input data were the first 14 pnncipal component 
scores derived from the 38 vanables of the correlation matrix The 
inmal partition of the elements was therefore into the group lying 
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above and the group lying below the mean of the first principal com- 
ponent — the component with the greatest variance ) This initial parti- 
tion IS improved until it is optimal The algorithm proceeds from two to 
three groups by similarly dichotomizing the group that has the greatest 
internal variation (the greatest inhomogencity). and the procedure con- 
tinues until a sufficiently large number ofclassificationsoraclassification 

into a sufficiently large number of groups has been obtained For the 
districts of Inner London, results were produced for classifications into 
2 groups, 3 groups, and so on» to 30 groups 
The output of the computer program consists of the following items 

1 An allocation matrix in which each clement, identified by a unique 
number, is listed with the identification number of the group to which 
that element is allotted in each classification 

2 The number of elements allotted to each group in each classification 

3 The mean vector of each group in each classification, specified 
m standard scores above or below the mean of the entire population 

4 The total variation within each group of each classification (the 
inhomogeneily) 

5 The total variation within each group of each classification sepa- 
rately for each of the variables used in the classification 

6 The proportion of total variation which is taken up by vanation 
bet\^een the groups of the classification, for each classification (the 
homogeneity) 

In addition to this matenal, a matrix summarizing the relationships 
between all the groups of successive classifications shows (1) the 
identification numbers of the groups in all other classifications with 
which each group in any given classification has the largest number of 
elements in common, (2) a count of the number of elements that do not 
follow the general pattern of interrelationship between classifications 
that IS shown by the majority of elements 
From this output the results of any classification can be mapped and 
the groupings interpreted Obviously, for a project involving the mani- 
fold classification of 4,570 enumeration districts, not every classification 
could be separately mapped and interpreted in full Technically, the 
best, but trivial, classification would be one m which every element was 
m a group on its own (m which case the total vanation would be en 
tirely accounted for by variation between groups) In practice, the 
choice of a classification for further analysis will depend on the par- 
ticular needs of any inquiry and intuitive preference for a particular 
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number of groups [Jott^ver wiilim a given ungc of possible ebssi 
JiMUons ihcre ire a number of stawtiul crjleria ihat indicate which 
dassification »$ preferable to others m {he range In the analysis of 
Inner London classificatiom into two and si* groups were selected for 
most detailed inspection The purely staiistml criteria for this choice 
ma) he applied to any lypsologv derned from similar data but specific 
intuitive criteria arc rcles ant to this particular siud> 

The iMofold classification ssa* chosen primarily for the intrinsic 
interest ofa simple dichotomy — to lest fnreximple whether the terms 
fast Tnd West Lnd or Center and Suburb had any meaning 
The mfold classification insoived more complex criteria The first 
criterion involves comparison of ihc mersiirt of homogeneity of any 
ctasvificniion with lh*t of its immediate neighbors As the proportion 
of total sanation recounted for by sarniion between groups is bound to 
increase with an increase in the number of groups m the classification 
thchomopcneitj increases with successive classifications Thcdifierences 
betsveen the homogeneit) of each classification and the next show how 
much IS to be gained b> going from one classification to the next There 
fore Within a gisen range a classification is considered to be a good 
one if the grm m selecting it rather than the previous classification is 
greater than the gam in forsaking it for the classificriion that follows 
Classifications for which (his is true are those where the second dif 
ferenccs tn the origiml seijucnce (of proportions of total variation 
which 18 variation between groups) arc hrge with negative sign B> this 
criterion the two» three four and sixfold classifivations vsere the 


best in ihil order However since the second differences also showed a 
definite tcndcnc) to incrcisc with the number of groups m the classi 
fication T similar argument was applied Ircaiing the second difTcrcnces 
as the original sequence of metsurcs this led to inspection of the fourth 
differences which indicated that the sixfold classification was the most 


satisfaciory 

A second important criterion is the mhomogeneity of each of the 
scpir itc groups of a chssification The total variation within any group 
wfitth is part of the prsigram autpat ts obviously affected by ibe number 
of districts in the group Hut a measure of homogeneity which is in 
dependent of the number in the group is the variance of the group 
obtained by dividing (he varntion by the number of elements m the 
group The virianee of each group «n then be compared with the over 
all variance of the population and groups with particularly large 
variances excluded from further analysis By this criterion we set an 
upper limit of nine groups, all classifications into ten groups or more 
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included groups with particularly small numbers of districts and with 
variances as high as four times the variance of the total population 
A third cnlerion related the results of successive classifications to the 
analysis of the most important axes of dificrcntiation which were 
established in Phase 2 The over all measure of homogeneity, the total 
beiwecn-group variation, can be broken down into the between-group 
variation of each individual variable The variables that best distinguish 
between all the groups are those for which this clement of variation 
IS the largest For all the classifications from the twofold to the sixfold, 
the most important differentiating variables were those which defined 
the four basic axes of differentiation outlined earlier under Phase 2, 
these were Socioeconomic Status. Sharing, Room Density, and 
Transience For the sevenfold and subsequent classifications, relatively 
unimportant differentiating variables, such as the proportion of the 
population bom in Cyprus or m Malta and the ratio of dwellings to 
buildings, began to play an important distinguishing role A sixfold 
classification of the enumeration districts of Inner London was there 
fore the most detailed classification we could achieve, given the method 
and input data considered here, which difTcicntiated only according to 
important general factors of difference 
The classification procedure is based on purely logical considerations 
There is no guarantee that the groups of any of the classifications will be 
meaningful, either geographically or sociologically, since the process is 
neutral to both geographical and sociological considerations In this 
case, the classifications analyzed were meaningful in both respects A 
summary of some salient features of the twofold and the sixfold 
classifications of enumeration districts and of the sixfold classification 
of all the wards of Greater London occupies the rest of this paper 
Despite the geographical complexity emphasized earlier, the twofold 
classification of enumeration districts produced a relatively simple 
geographical pattern A central groupof 855 distncts was identified in the 
North-West quadrant of Inner London The group was geographically 
compact and extended to the boundary of the old Administrative 
County in the North Apart from this central block there were a few 
isolated districts located, in most cases, close to the fashionable 
commons of South London 

This simple dichotomy, concentric m pattern, isolated the cosmo- 
politan center from the rest In the central block occupational status 
was high and the proportion of early school leavers particularly low — 
40 per cent on the average, compared with 80 per cent in the rest of 
Inner London The proportions of single adults and of small households 
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"ere large, and a relat.iely large proportion of the popnial.on was born 
outs.de the British Isles and the New Commonwealth (18 per cent) 
Despite the high status of the center, however, there was little dilTerence 
between the two sectors m housing conditions 
Subsequent classifications heightened the detail of London’s sub- 
division, but this central block remained until the classification into six 
groups In the sixfold classification it was split up (although not in a 
straightforward way) into a smaller central core and a fringe of dis- 
tricts in which the element of “transience” was much stronger The 
high-siatus districts of the central core were named, for shorthand 
purposes, the “Upper Class” type (Type 1) 

The central core was surrounded by an irregijiar band of distncts in 
which almost half of all adults were single and 45 per cent of households 
consisted of only one person Forty-two per cent of households had 
furnished accommodations, cnobtUty was hrgh. and the proportion of 
early school leavers was relatively small These characteristics, although 
less strongly marked, were also distinguishing characteristics of the 
central core, but in this peripheral type, housing conditions were much 
worse— m particular, there was far more sharing of dwellings and 
facilities Because of the pattern of characteristics which these 
distncts showed on average, th^ were named the ‘ Bed-Sitter" type 
(Type 2) 

Geographically, districts of the "Bed-Sitter” type had a clear con- 
centric relationship to the distncts of the central core, they rarely 
occurred in isolation from the "Upper Class ’ type, most of them lying 
immediately adjacent to the central core 
Although none of the remaining types of districts m Inner London 
had such a strong concentric pattern, the third type described did tend 
to occur at roughly the same distance from the Central group m each 
direction This was the “Poor” type of district (Tjpe 3) found in the 
worst areas of North Kensington. Paddm^on, Islington Whitechapel, 
and Brixton The distinguishing features of this tjpe were large pro- 
portions of households in overcrowded dwellings and sharing domestic 
facilities, low social status, and relatively large proportions of children 
under five —amounting to 10 per cent of the population This type of 
district included the mam areas of West Indian. Cypriot, and Irish 
immigration , . . „ , . r 

Compared with the pattern of prewar slums, the Poor type of 
district identified in the sixfold classification occurred m relatively small 
patches in Inner London An cjlcnsive block of blishted housing "h.ch 
previously ran throush the East End and nverside boronghs had, m 
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1961, been broken up by the slum-clearance efTorts of local housing 
authorities and the effects of bombing during the last v-ar. Few very 
extensive patches of blight remained, apart from Whitechapel, among 
the two types of district which then accounted for most of the old slum 
areas The “traditional” slum areas were mostly made up of districts 
that, m terms of our typology, may be called the “Stable Working 
Class” type and the “Local Authority Housing” type (Types 4 and 5, 
respectively) 

The “Stable Working Class” type was the largest group of districts 
m the sixfold classification (34 per cent) and may be regarded as the 
“average” type for Inner London; on none of the variables was this 
type clearly distinguished from the mean for Inner London as a whole 
However, the average distnct for Inner London was one in which 65 per 
cent of the population were in manual occupations and three quarters 
left school before the age of sixteen, almost one third of households 
shared a dwelling, and one third shared a W.C This t>'pe of district 
was named “Stable Working Class” because, although housing con- 
ditions and social status were comparable to those of the “Poor” type 
described earlier, the immigrant and recently mobile population was 
much smaller It is important here to note the effect of defining all 
types m terms of their deviation from the London mean, the result is a 
typology that is not “absolute” because the allocation of individual 
distncls IS affected by changes in the size and mean characteristics of 
the classification universe If the districts of Inner London had been 
classified together with those of Outer London and ifthe mean character- 
istics of Greater London had been used as the reference, many more 
districts would have been classified as “Poor” where, in Inner London 
alone, they were merely “average ” 

The “Local Authority Housing” type (Type 5), contained 1 8 per cent 
of the districts of Inner London but showed no clear geographical con- 
figuration This IS not surprising in view of the piecemeal development 
of public housing and the freedom of public housing authorities from 
some, at least, of the normal market forces, the mam patches of this 
type of districtwerethe large counalcstates.mostly close totheboundary 
of Inner London A large proportion of children under fifteen, few small 
and many large households lived in these districts, although average 
room density was quite high, there was very little actual overcrowding, 
and sharing of dwellings and facilities was particularly uncommon 
The average charactcnstics of the “Local Authority Housing” type 
were apparently straightforward, but closer inspection of the districts 
revealed, m acute form, one of the difficulties of interpretation of this 
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type of multivariate classification The shorthand descnption, ‘‘Local 
Authority Housing,” was chosen because, on the a\crage, 75 per cent 
of households in any district rented their accommodation from a local 
authority , however, the range wasTrom under 10 per cent to 100 per cent 
within districts, m four out of five of the other types of district there 
were individual districts (hat had 80 per cent or more households m 
council tenancies This “overlapping” between types in the classification 
in terms of a single variable arises because every vanable is taken into 
consideration in the process of classification Because all the character- 
istics of the districts were considered m (he process at the same time, 
the classification did not produce results that had a very distinct mean- 
ing m terms of any one variable on its own In the case of “Local 
Authority Housing” a district having, in extreme form, all the character- 
istics normally associated with council housing such as a high propor- 
tion of children under fifteen and similar housing conditions, might 
have been classified as “Local Authority Housing" type even though 
there were no council tenants m it This argument applies equally to all 
the variables used 

The “Stable VVorking Class” type and the "Local Authoniy Housing’' 
type formed the fourth major band of districts in Inner London They 
were broken up in some areas by the “Poor” type described previously 
On the southern edge of Inner London they gave way to (he final type 
identified as the “Almost Suburban” iyj>e (Type 6) These districts had 
a relatively high proportion of owner-occupiers and above average 
occupational status and housing conditions 
The map (Figure I) shows the result of this sixfold classification of 
enumeration districts for the districts falling withm the Greater London 
Borough of Camden The intermixture of all types of districts shown 
on this map is fairly typical of Inner London as a whole I^etailed 
results for the whole of Inner London are to be published early in 1969 
m Volume I of the Third Survey of London Ufe and Labour 

The typology of Inner London’s enumeration districts, described 
earlier, was largely confirmed by a classification of all the electoral 
wards of Greater London Electoral wards were classified into from 
two to SIX groups using only seven of the original variables A sample 
analysis of the variation between all the enumeration dislncts of Greater 
London indicated that variation between wards would a^unt for 
about one half of the total vanaljon Seven of the original vanables 
were computed as input for the ward dassrfication — (he variables that 
best distinguished between the first few classifications of the enumera- 
tion districts of Inner London These were. 
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DescripUort 

Per cent aged under 15 
Per cent single adults 
Per cent one- or two-person households 
Per cent of all households overcrowded 
Per cent of households sharing a dMcIling 
Percent early sdiool leavers 
Per cent males m unskilled manual occupations 

Although the difference between means of Inner London and Greater 
London naturally affected comparison between the classifications the 
pattern in Inner London described previously was largely repealed 
The centra! “cosmopolitan** block again occurred but with no dis- 
tinction between the central cote and the peripheral “Bed-Sitter” 
fringe, instead, the suburban areas of Outer London were divided info 
two types, a “Higher Status Suburban” type and a “Lower Status 
Suburban” type 


Variable No 
2 
9 
12 
15 
18 
38 
41B 


CONCLUSION 

The data and the classification procedures used in this project do lead 
to a typology of London’s subareas which isintuttivcly satisfying— the 
complexity of London is reduced to a meaningfuJ pattern But a true 
appreciation of this complexity is itself one of the rewards of the 
analysts Stratification of areascannof be satisfactorily achieved wifhoui 
recognizing that housing conditions and “transience,” as well as occupa- 
tion and schooling, play animportantdi/Terentiatingrole Onceachievcd, 
an optimum classification, which recogmzes differences m all these 
respects and more, docs not produce a geographically ample result 
Although in the preceding summary, the location of different types of 
district has been described mainly in terms of the occurrence of sub- 
stantial areas ofa similar type, some parts ofLondon are better described 
in terms of their heterogeneity and the mixture of all types of district 
wiihm close proximity to each other This is one of the aspects of 
London which, although apparent to the pedestnan on London's 
streets, has not previously been measurable oa the large scale attempted 
here 
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Variable 

No 

1 

2 

3 

4 


5 


6 

7 

8 

9 


10 

11 

12 . 

13 

14 


15 


16 

17 

18 

19 

20 
21 

22 . 

23 

24 


Appendix B 

Variable Numbers avd Titles 


Variable Title 


Percentage of the population aged under 5 
Percentage of the population aged under 15 
Percentage of the population aged 65 or over 
Percentage of the population of pemionable age 
[(Enumerated males aged 65 or over and females aged 60 
or over — enumerated population all ages) x 100 0) 
Fertility ratio 

[(Enumerated population aged 0-4 — enumerated females 
aged 15-44) x 1,000) 

Femalc/male ratio all ages 
Female/male ratio ages 25-44 

Percentage of women aged 20-24 who have ever been 
married 

Percentage of adult population that arc single 
[(Enumerated population aged 15 or over that are single — 
total enumerated population aged 15 or over) x 100 0) 
Number of dwellings per building 
Percentage of households containing one person 
Percentage of households containing one or two persons 
Percentage of households containing fiv e or more persons 
Percentage of households containing five or more persons 
which are overcrowded 

[(Households present at census containing five or more 
persons who live at a residential density of over 1 J persons 
per room — total households present at census containing 
five or more persons) x 100 0) 

Percentage of all households that are overcrowded 
[(Households puesent at census with a residential density of 
over 11 persons per room — total households present at 
census) x 1000) 

Average number of persons per room 
Average number of rooms per dwellmg 
Percentage of households sharing their dwelling 
Percentage of households sharing their dwelling, stove, and 
sink 

Percentage of households shanng or entirely without a WC 
Percentage of households shanng their dwelling, stove, sink, 
and shanng or without a W C 
Percentage of households owming their accommodation 
Percentage of households renting their accommodation from 
a Local Autbonty 

Percentage of households renting pnvate furnished accom- 
modation 
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Variable 

No Vanabfe Title 


25 

26 

27 

28 

29 

30 

31 


32 


33 

34 

35 

36 

37 

38 

39 

40 


Perccfifage of AousehoMs renting private unfaraishcd 
accommodafton 

Percentase of one person households coniajHins ooe old 
person 

Percentage of two-person households contaimng two old 
people 

Percentage of the resident population bom in India, Pakistan, 
or Ceylon 

Percental of the resident population bom m British Africa 
Percentage of the resident popalation bom in the British 
Caribbean 

Percentage of the resident population bom m the New 
Commonwealth (sum of variables 28, 29, and 30) 

((Resident population stating their birthplace as India. 
Pakistan. Ceylon, British Africa, or the British Caribbean 
- tola! papula tlo0 stating residence m England and Wales, 
or not statine usual residence) x 100 0] 

Percentage of the resident population born elsewhere outside 
the British Isles and the New Commonwealth 
((Resident populafion stating binhplaee as anywhere other 
than Great Britain. Ireland India, Pakistan, Ceylon, 
British Africa, or the British Caribbean - total population 
ttatuig residence lo England and Wales, or not stating a 
usual residence) x 100 OJ 

Percentage of the resident population born anywhere outside 
(he British Isles (sum of variables 28, 29, 3t?, and 32) 
Perrentage of the resident population bom in Malta (in- 
cluded in variables 32 and 33) 

Percentage of the resident population bom in Cyprus (in 
eluded ;n vambies 32 and 33) 

Percentage of the resident population bom in Ireland (both 
parts) 

Percentage of the population that moved into the local 
authority area within IZ months of census dale 
Percentage of the population aged 15 or over that left school 
before the age 16 

Percentage of the economically active population that are 
female 

Percentage of occupied and retired males whoareemp/oyers 
or in professional or managerial occupations 
((Occupied and retired males in socioeconomic groups 1, 2, 

3 4 and 13 - total occupied and retired malts in all 
socioeconomic groups) x 1000] 
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Variable 

No 


Variable Title 


41 Percentage of occupied and retired males in manual occupa- 

tions 

[(Occupied and retired males in socioeconomic groups 7-12 
and 14-17 — total occupied and retired males in all 
socioeconomic groups) y 100 0] 

42 Percentage of the economically active population working 

outside their local authority area of residence 


Additiosal Variables Not Used is the Classihcation 
OF Esumeratios Districts 


Variable 

No 


Variable Title 


40A Percentage of occupied and retired males who are employers 

or in managerial occupations 

((Occupied and retired males in socioeconomic groups 1, 2, 
and 13 — total occupied and retired males in all soao- 
economic groups) x 100 0) 

40B Percentage of occupied and retired males m professional 

occupations 

({Occupied and retired males m socioeconomic groups 3 
and 4 — total occupied and retired males in all soao- 
economic groups) x 100 0] 

41A Pcrcenuge of occupied and retired males in skilled manual 

occupations 

((Occupied and retired males in socioeconomic groups 8, 9, 
12, and 14 — total occupied and retired males in all soao- 
cconomic groups) X 100 0] 

^1® Percentage of occupied and retired males m unskilled manual 

occupations 

[(Occupied and retired males in socioeconomic groups 7, 
10, II, 15, 16, 17 — total occupied and retired males in all 
socioeconomic groups) x 100 0] 

Percentage of occupied and retired males in intermediate and 
junior nonmanual occupations 

[(Occupied and retired males in socioeconomic groups 5 and 
6 — total occupied and retired males in all socioeconomic 
groups) y 1000] 

44 Percentage change in population 1951-1961 
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Least Squares Classification and 
Principal Component Analysis; A Comparison* 

I geometrical comparison 

Tiinii: IS A FUNDAMENTAL ANALOGY betuEcn the method by which the 
4 500 enumeration districts of London Administrative County were 
classified into groups and the method of principal component analysis 
bv which the principal factors of variation among the districts were 
discovered To bring out this analogy I shall discuss these two methods 

Th^data for both methods were the values of about 40 census 
vnnablcs for each of about 4,500 districts These data may be displayed 
m a table, or matris, with 4 500 columns and 40 rows The number 
say in the ith row and jth column IS the value of the ith variable for 

"’Vhc S«t‘of principal component analysis is to sumraanae these 
rlnta in a matrix with the same numter of columns but fewer rows 
That IS a smaller number of new variables G sough, to express as much 
L possible of the diversity between districts expressed by the original 

''“fe'object of teas, squares classification ,s to summariae the data 

Tavsioclc I96S^ 

Method for Cluster 
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in a matrix v.iih fev.er columns but the same number of rows. Let us 
regard the original data as gi\>nglhe means of 4,500 groups of districts, 
each containing one districL Then we can say that we arc seeking a 
smaller number of new’ groups, the means of which will express as much 
as possible of the diversity between the means of the ori^nal groups. 

How this is done in each case may be illustrated geometrically — at 
any rate, in the two-dimensional case. Suppose that there arc only two 
variables, perhaps a social-class index and a housing index, and only 16 
districts. Then ^ch district may, as in Figures I and 2, be plotted as a 
point in two-dimensional spacc. 

Figure 1 illustrates the finding of the first principal component, 
that is, a single new- %-ariable in terms of which as much as possible of 
the original disersity between districts may be expressed. The new 
variable is represented by a line such that the sum of squares of distances 
from the points to the line is a minimum. In the figure, the sum of 
squares of lengths of the arrow-s drawn is minimized. 

Figure 2 illustrates the partitioning of the population into two groups 
using least squares classification. The two groups are chosen so as to 
minimize the sum of squares of distances from any point to the mean 
of its group. Thus, again in Figure 2, where the means of groups are 
represented by crosses, the sum of squares of lengths of the arrows is 
minimized. 

In each case the data are then summarized by gj>ing for each district, 
not its actual position in space, but the position of the arrowhead 
drawn from iL In Figure I this makes it possible to characterize the 



figure / 
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Figure 2 

dfsinct by a value of just one vanable — the position of the arrowhead 
on the line In Figure 2, instead of sixteen positions in space, it becomes 
possible to give just two — (he positions of (he means of the groups 
These figures illustrate (he problem m two dimensions In m dimen* 
sions a principal component analysis seeks forp (< m) lines which are 
at right angles (o each other and thus define a p-dimensiona! subspace 
of the wj-dimcnsjonal space in such a way ihat the sum of squares of 
(he m-dimensional distances through which the points have to move 
when projected into the subspacc is a minimum This cannot be iKus 
trated in two dimensions 

A P'fold classification, on the other hand, can be illustrated in two 
dimensions, though it should be imagined as taking place in m dimen- 
sions A partition of ti points into p (< n) ^oups is sought in which 
the sum of squares of all /»-dimensionaJ distances from the points to 
the means of iheir groups is a minimum 
It can be seen that the results will depend on the units m which the 
variables are measured along the axes In fact, neither a principal com 
ponent analysis nor a classification can be earned out without choosing 
not only a set of variables but also the units m which those variables 
are to be measured , and these choices must be made a priori 
This problem frequently anses m roultivanatc statistics, and it is 
often resolved by the decision to use standardized variables, which are 
measured in units of standard deviations from their means This was 
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what we decided to do What should be emphasized is that this choice 
had to be made on intuitive grounds, a priori It is natural that, in 
choosing the variables to be used, we should define the field of study 
in which we are interested But if mathematical methods are to be used, 
we may have to make that definition very explicit In particular, for 
component analysis or classification the definition must specify units 
for the variables chosen 

2 MINIMIZATION OF VARIATION LOST 

Defining the iflf/a/ioH o/ a mulitxanable population — one each of 
whose elements is described by the values of m variables — as the sum 
of the variations of all the variables, 

"•S’ = Z Z (*« - 

l-» /-I 

(where is the mean of the ith variable), we can say that both principal 
component analysis and least squares classification replace the original 
population by another that has less variation, and both seek to mini* 
mize the amount of the original variation that gets lost in the process 
Geometrically, the variation that gets lost in each case is the sum of 
squares of the arrows illustrated in Figures I and 2, the variation 
retained is the variation of the population that replaces the original 
one, and that is the population of the points at the tips of arrows in 
Figures 1 and 2 In Figure 1 it is a population of n points on the line 
of the principal component, in Figure 2 it is a population of n points 
at the means of the two groups 
To verify these statements, we show that a general formula 
Total variation = Variation lost 4- variation retained 

does, in fact, hold in each case, and we derive the particular forms in 
which It holds Then, working from thcparticularformula for each case, 
ue show how the principal component problem and the classification 
problem are each solved by seeking to minimize the variation lost or, 
equivalently, to maximize the variation retained 
Clearly, there is a trivial solution for each problem by which all the 
original variation would be retained If m axes at right angles or n 
groups of elements each containing one clement were chosen (n being 
the original number of elements) then the variation lost would be zero 
Moreover, the vanation retained would in both cases increase (or, at 
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■^11 ^12 ^\p 

^21 ^22 ^2 p 

{%} = 

.JC«1 X„^ 

in which the jth column gives the mean values of each of the original 
variables for the jth group, this matrix does not, m general, have any 
columns in common with the matrix of p p) columns which is given 

by the classification into p groups 
The same phenomenon naturally appears if we introduce and com 
pare the other two matrices given by the two solutions The other 
matrix found by component analysis has p rows and m columns and 
gives the general definitions of all the p new variables in terms of the 
original m variables (The /th such definition is of the form 


X/ — fljiXj -h 0,2X2 + + ^ 

so that, m fact, the /th new variable is a linear function of the m original 
ones From this definition we may deduce each of the values m the /th 
row of the matrix (xj/} given the matrix {x,,} which comprises the 
original data) 

The matrix of definitions of all the p new variables is written 



In the /th row this gives the coefficients occurring in the definition of 
the iih new variable In this matrix, too many solutions are compre 
hcndcd That is the first / rows give the general definitions of the / new 
variables found when only / new variables are sought 
Contrast this matrix with the one obtained by a solution to the p fold 
classification problem This is a matrix with n rows and p columns in 
which the clement S„ occurnng m the ilh row and /th column is 
defined by 


5 _ f I if the /th point is in the _/th group 
\ 0 if not 
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Thus the ith column gives the membership of the jthpoup and m 
leneral It 'v.ll not nLn the same as p is varied Thus again -e 
cannot neatly summarize many solutions in one ‘’Yf ' 

the matrix (xul can be deduced from {S„) together tilth {x „) ) 
T“ofter d sSvantage suffered by a classification solution is at 

SSSSiilSI 
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as possible of the original variation — provides a general yardstick of 
“merit ’ for the objects It is obviously justifiable to use it as such ifthc 
proportion of the variation it absorbs is high If it is not, the alternative 
of classifying the objects into groups whose members score highly in 
different variables, without attempting to impose a general jardstick, 
might be better 


4 DVAUTY RELATIONS 

In Section 5 I shall show algebraically that there is a relationship of 
duality between least squares classification and a certain kind of 
restricted component analysis It transpires that a least squares classifi- 
cation is simply a component analysis performed on the transposed 
matrix of “observations” instead of the matrix X = {xi))> 

except that certain side conditions are added to this component problem 
That IS, the process of classification may be described as follows 
Instead of regarding Xtj as representing the value of the /th variable 
for the jth element, we regard it as representing the value of a fictitious 
jth variable for a fictitious ith element Then we perform a component 
analysis subject to these additional conditions 

1 Each fictitious variable shall have a nonzero coefficient in the 
definition of one and only one component 

2 The nonzero coefficients m the definition of a given component 
shall all be equal 

3 No component shall have only zero coefficients m its definition 
(This assumes that the number p of components is fixed beforehand) 

Conditions I and 3 correspond to the condition in the(nontransposed) 
classification problem that the elements be partitioned into p subsets 
Indeed, let us recall the definition of a partition of a set S’ as a set of 
subsets of 5 such that 

I Any two subsets are disjoint 

II Together the subsets add up to S 

III No subset IS empty 

Thus we can easily see that Conditions I and 3 require the component 
analysis to determine a partition of the set of m variables into p subsets, 
the gth subset being the set of variables with nonzero coefficients in the 
definition of the ^th component 

Condition 3 is, of course, unnecessary in the sense that, if a certain 
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amounf of varialion can be retained using less than p subsets, at least 
the same amount can ah\a>s be retained using;? I put it in for complete- 
ness, because it occurs m the definition of a partition 
Condition 2 corresponds »n the classification problem, to the fact 
that, of two elements m a group onccannot be "more” in than the other 
Each IS either m or out In the component problem it combines with 
Condition 1 to ensure that the problem of fmdmg just the firrt compo- 
nent (solving the problem for p == I) has only one trivial solution, 
this corresponds to the fact that there is only one trivia! one-partition 
of a set of elements 


Condition J ensures that the axes of the p components found will be 
orthogonal But, in fact, tl says more It says that any axis must be 
orthogonal to the axes of all the original variables except that which 
has a nomero coefTicieni w its dcRnitton 


TIic restncieif component problem having these side conditions, is 
not of course as simple to soKe as the ordinary unrestricted one In 
fact, Coriditions I and 2 are essentially logical “either or" conditions, 
thus the problem is not amenable to ordinary calculus methods This 
is why the classification problem, unlike the unrestricted component 
problem, can be solved by enumerative methods but cannot otherwise 
be solved in such a way as to guarantee global optimality However, i 
shall prove a necessary condition for an optimal solution, and this can 
be enforced by iterative methods 

On the other hand the components obtained by a restricted com- 
ponent analysis arc defined in a much simpler way than those obtained 
by an unrestricted analysis And this is perhaps the basic reason why 


classifications are easier to interpret 
Finally, these relationships of duality enable us to perform an 
"actual" restricted component analysis by performing a 'RctiUous" 
classification on the transposed matrix The point of this would be to 
find components that are more easily interpretablc than the ordinary 


principal components 

(Such a form of component analysis might seem to be of limited 
applicability because of Condition 2 which requires not only that the 
nonzero coefficients be equal m magnitude but also that they have the 
same sign This, as it stands, means that a vanabJe that would fit very 
well into a certain subset of variables is excluded simply because of its 
sign Tn fact however, there is no compuialional difficulty m simply 
disregarding ’the signs of variables, although here I shall not d.si^ard 
signs, because to do so would obscure the duality relationships 1 am 
discussing ) 
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5 ALGEBRAIC DISCUSSION OF DUALITY 

Suppose we are given a set of n elements each cliaraclcnzcd by the 
values of m variables The data may be represented in an m A n matrix 

A' = {x„) 

as noted before I shall write for the jih column vector of this matrix 
— the vector of values characterizing the jth element — and xj for the 
ith row vector — the vector of values of the ilh variable Thus 



(The superscript ^ is used to indicate transposition ) 

Also I write 

for the vector of means of rows and 

for the vector of means of columns Of course, | has a statistical and 
intuitive meaning as the vector of means of the variables, whereas X 
has not much meaning 

Consider first the principal component problem 1 shall discuss this 
briefly, as it has often been discussed elsewhere The problem is to 
find a matrix I' = , v^) and a vector h such that 

X|(f- A)" -/Ol'l 

is a minimum subject to V^V = / Here the vectors v are any set of 
orthogonal unit vectors determining a p dimensional subspace, h is the 
origin of the subspace, and the notation v*, where v is a vector, is used 
for brevity to denote the inner product of v with itself 
The expression to be minimized is the sum of squares of arrows 
illustrated for the case p *= 1 m Figure I And when the matrix has 
een found, the p x n matrix X in which the n original m vectors are 
represented (as closely as possible) by « vectors is 

^ = (fi O = - /,) (f„ - A)) 
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An optimal value for A is ft =? Let 7 - f f and g ft ? 
Then the expression to be minimized 

IS always greater than or equal to 
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problem defined on the matrix instead of the matrix X Thus it is 
to find a matnx V = (v, Vp) such that 

I - (v^xY) 


IS a minimum subject to the same three conditions as the primal 
problem Here the x’sarelherowsofA'fwhercas the ^’sare the columns) 
It is obvious that from any statement about the primal problem one 
can deduce a statement about the dual problem, simply by substituting 
x's for f’s and wi’s for n's Algebraically, this is completely trivial, for 
m this sense there is nothing about a matrix X which tells us that the 
columns correspond to elements and the rows to variables, rather than 
the other way about However, statistically the two problems have 
quite different meanings, indeed, there is a problem — what does the 
dual problem mean*^ 

The classification problem is to find a partition P of the set {f } of 
columns of X into p subsets A such that 

Z I (f - 

Am P (in A 

IS a minimum Here represents the mean vector of the vectors f which 
are m the set A, so that 


where is the number m the set A 
The dual classification problem is to find a p partition of the set {x} 
of rows of X such that 


Z Z (■>' - XiY 


IS a nummura, where, if m, is the number of rows in the set A, 

= z ^ 

xinA 

The same considerations apply to this pair of dual problems The 
primal problem is that of classification, described in Section 1 , and has 
^ mterpretation But what is the meaning of its dual 

problem What statistical meaning, for example, has x^’’ 

eouwiilf f’l J '•«'-'«ed component problem is 

equivalent to the primal classification problem Hence, by duality, the 
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dual classifralion problem is equivalent to the primal restncled com 
ponent problem 

Thus consider the dual reslncted component problem Since 
IS a constant the problem is to maximize 


Now the conditions on V are that the set of permissible K 5 is in 
one to one correspondence with the set of ad p partitions P of the set 
of columns of A', the columns > of Kcomspond to the sets m the 
corresponding partition /'msuchawayChatthci sayv^ corresponding 
to A IS defined by 

1 

»4 * ± “Jl 


where is the number in (he set A and is a vKtor having a one m 
the yih position if (j is m A, and a zero otherwise 
Hence the problem is to maximize 


2 

AinP 


I 

in / 




over all /T-partitions P of {{) Since 



this IS to maximize over P 
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Next 
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into two terms as follows 


which IS a constant decomposes 
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The dual classification problem is to find a p-partition of the set {xr} 
of rows of X such that 
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IS a minimum, where, if is the number of rows m the set A, 
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X 
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The same considerations apply to this pair of dual problems The 
prunaJ problem is that of classification, described in Section 1 , and has 
a straightforward interpretation But what is the meaning of its dual 
problem'^ What statistical meaning for example, has 
Actually, as I shall show, the dual restricted component problem is 
equivalent to the pnmal classification problem Hence, by duality, the 
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= s s[«-f.)+Lr 

= z s {(-LY + I.'’aL^ 

A i in A A 

the cross-product term vanishing since 

Z U-L) = 0 
f in y< 

Hence, the problem is to minimize over P 

I S « - hY 

A in P (in A 


which IS merely the classification problem 

This IS proof of our assertions Hence, instead of four problems there 
are tv.o — restricted component analysis and classification These are 
easily mterpreiable The discussion may now be summarized and illus* 
trated geometrically 

Restneted component analysis might also be called dual classification 
or elassificauon of lariables Geometrically, it means that the gi'cn 
raatnx X may be visualized either as n points m m dimensions or as m 
points in n dimensions For example, the malnx 



(I) 

(2) 

[11 

r I 

-I 

P) 


3 

[3] 

L 2 

-2 


giving the results of observing the values of three variables for each of 
two elements and measuring the variables m units of deviation from 


their means, may be represented geometrically either as m Figure 3 or 
as in Figure 4 

The representation of Figure 3 is the same as that of Figures 1 and 2 
The classification problem using this representation is illustrated in 
Figure 2- The dual classification problem, on the other hand, is the same 
geometncally as the classification problem if we use, instead, the rep- 
resentation of Figure 4 It is to group together points in space represent- 
ing variables (instead of elements) so as to minimize the sum of squared 
distances from the points to the means of their groups, therefore, 
obviously, in the problem of Figure 4 the optimal classification into 
two groups would group together vanables (1] and [3] 
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It IS interesting that this process actualj> minimizes the sum of 
squared distances of points from certain orthogonal axes in the space 
illustrated m Figure 3 These axes pass through the origin, there is one 
axis for each group of variables, the direction cosines of an axis are 
equal and nonzero in the variables belonging to its group The axes 
found by choosing the classification of variables 

m 13 )}. {[ 2 ]} 

are shown by dotted lines m Figure 3 
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Ecological Factors in 
Metropolitan Zones and Sectors* 

PJtOBLEif 

The USB of factor analysis as a method for discovering and generaliz* 
mg the fundamental dimensions along which ecological areas are 
differentiated has become commonplace Depending on the investigator’s 
focus of interest, nations (Cattell, 1949), counties {Jonassen and Peres, 
1960), Cities (Hadden and Borgatia. 1965, Oitius J965), communes 
(Piepponen, 1962. Janson. 1965) have been studied in this way But 
probably a majority of ecological factor analyses have been concerned 
With the internal difTcrentiation of cities and have used more or less 
conventional demographic measures for census tracts or statistical areas 
Within single cities or mctropoJiian commomiies as input for the 
analysis (e g , Anderson and Bean, 1961, Bell, 1955, GrCnholm, 1960, 
Janson, 1966, Pedersen, 1965, Schmid and Tagashira, 1964, F Sweet- 
ser, l965o. 19656, 1965c, 1966) In applications to data for statistical 
areas within cities and metropolitan communities, factor analyses re- 
duces the complexity of large correlation matrices for dozens of vari- 
ables to a relatively few factore, which are then interpreted as the 
fundamental dimensions of the community’s ecological structure Work 
to date shows that the method docs indeed produce useful summary 
descriptions of the ecological siniclurcs of individual communities, 

• TbJS research has been supponed by the Gradiiaie School Bosion University, 
and by a grant from the American Phitosoptncal Society Cotnptitanoni uerc done 
m part at the Compuiatton Cenier. Atassacbuseiis Insiiiuie of Technotogy, and m 
pari at Ihe Computation Center, Bosion University 
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With the few fundamental dimensions identified and related concisely 
to the numerous variables analyzed Moreover, these studies offer the 
prospect that through comparative analysis clusters of factors found 
typically in one set of communities which differ systematically from 
clusters of factors found typically m another and distinct set of com- 
munities may be discovered Thus a typology of commun ties, clas- 
sified according to their ecological structure, ma> be developed The 
development of such a typology for metropolitan communities m 
urban industrial nations is the long range objective of the writer’s 
continuing research on aspects of metropolitan ecological structure 
But before this goal can be reached, a number of procedural and tech- 
nical problems must be solved (F Swcelscr, 1964, Janson, 1964) The 
present paper addresses itself to one such problem, the question of the 
relative stability of ecological factors The approach taken is to view 
the metropolitan community first as a whole and then as divided into 
a number of zones and sectors and to perform factor analyses of a 
selected list of variables separately for the total metropolis, each of 
several zones, and each of several sectors The pragmatic assumption 
is made that when factors from differcnl total, zonal, and sector matrices 
show marked similarity in their profiles of factor coefiicients, a stable 
dimension of ecological structure has been discovered On the other 
hand, when similarity is less marked or is limited to only a few factor 
coefficients including the coefficients for the “defining variables” in 
terms of which the factor is interpreted, a condition of instability is 
assumed to exist Both similarities and difierences are of sociological 
interest, fust, as revealing much about the patterns of residential area 
differentiation in different geographic sections of a particular metropolis 
and second, as providing a basis, through comparison of different 
metropolitan communities, for broader generalizations about types of 
ecological factors Based on data from the writer’s ongoing studies of 
the social ecology of metropolitan Boston and of the Helsinki metro 
politan region, selected interzonal, intcrsector, and cross national com- 
parisons of factor profiles will be described as a basis for some tentative 
generalizations about types of ecological factors 

DESIGN 

For Boston, thirty-four conventional ecological variables were com- 
puted from published and unpublished 1960 Census data for 441 census 
tracts in metropohtal Boston These variables are listed in Tables 1, 3, 
and 5 and are defined in Appendix A “Metropolitan Boston” consists 
of the Boston and Brockton Traded Areas as defined by the Census 
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added to the Boston trade arc Xhe total metropolitan 

from the analysis (F Sneetser, 196» P 2M) 

community was disided into ™ j j„j a„d clarity of ecological 

analysis.nsinghous,ng y^iPOP pp , 54 -, 55 ) 

barriers as criteria of t'™””'' ' „ , par the purposes of the 
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Total Metropolis 
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sectors 
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sectors 
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sector y 
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Figure 3 

Zones and Sectors in Ctiy of Helsinki 

of 121 statistical areas including agglomerations — division into zone 
and sectors was made so as to provide a reasonably large number c 
cases m each zone and sector while doing as little violence as possibl 
to known demographic and soaal differences between areas Even sc 
compelling facts of geography and population distnbution made i 
impossible to retain an adequate number of cases m the cast secto 
where there arc fewer statistical areas than there are variables, as show 
in the following tabic. 
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Zones 

inner 

middle 

outer 

Sectors 

west 

north 

east 

Total Metropolis 


Map Designation Number of Statistical Areas 


zone 1 

4n 

zone 2 

40 y total I2I 

zone 3 

40j 

sector 1 

45^ total 110(11 areas 

sector 2 

46 y of zone 1 not in 

sectors 

19 J any sector) 


12] 


Data for the cast sector are nevertheless included jn Tables 2 and 4 
since the factor profiles involved show reasonable consistency in spite 
of the fewness of cases m the analysis 
Using these zone and sector divisions eighteen different factor 
analyses were earned out for Boston total metropolis three zones 
seven sectors for Helsinki Region total metropolis three zones three 
sectors Each was a principal component $ analysis « ith 1 s placed in Ihe 
diagonals and with varimax orthogonal rotations of appropriate num 
bers of factors — 7 for Boston matrices and 6 for the Helsinki matrices 
The decision on numbers of Ihetors to rotate nas based on the modal 
number of factors having latent roots of unity or above in the several 
matnees for each community ThetallyisasshownmthetaWe Forboth 
communities rotated factors accounted for 80 per cent or more of the 
total variance m each matrix 


Number of factors in Matrix Hith Latent Root at J 0 or Above 
ffe/ankt 


Boston 

6 

7 

7 

7 

7 

7 


Matrix 
fetal 
zone 1 
zone 2 
zone 3 
sector 1 
sector 2 
sectors 
sector 4 
sectors 
sector 6 
sector? 


fROCEDVRE 

No effort is made m this paper to analyze ejibaustively the total of 
119 thirty three or thirty four vanable factor profiles produced by 
this design Instead certain factors of interest from the point of vieiv 

of stability inslabilily are selected for comparison and analysis A 
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previous comparison of ecolo^cal factors in metropolitan Boston and 
the City of Helsinki showed that three types of factors were present in 
both communities socioeconomic status factors, famihsm factors, and 
factors of residential status (F Swcctscr, 1966) The residential status 
factors were ill defined because of the absence of data on migration in 
Helsinki and on land use in Boston Moreover, the variable list used 
in the present Helsinki Region analysis is more defective in this respect 
than the list of 42 vanables previously used for the city itself, informa- 
tion available for boih city statistical areas and the metropolitan 
agglomerations is somewhat limited However, the other two types of 
factors found m both aties arc thought to be iim\ersal m the sense that 
they may be expected to appear as fundamental dimensions of ecological 
structure in all metropolitan communities Socioeconomic status factors 
and familism factors in zones and sectors of both communities are 
therefore examined (Tables 1, 2, 3, and 4) 

The previous comparison also showed that there were a number of 
special factors, deriving from the particular sociocultural character of 
their respective national settings, in both Boston and Helsinki These 
special factors included a number of ethnic factors (nonwhite, Insh 
middle class, Italian) in Boston and a factor called “career women” in 
Helsinki Differences m ethnic homogeneity and in female labor force 
partiapation between Finland and the United States m general, and 
between Helsinki and Boston m particular, are thought to account for 
these differences The special factors are usually less potent than the 
universal factors, as measured by their vanance, and are also less 
consistently defined m the vanous zones and sectors In order to 
illustrate these ditferences, the Insh middle-class factor as it appears 
in some of Boston’s zones and sectors is examined (Table 5), and the 
career women factor, together with a “Swedish language" factor and 
a. “cucafisra." facUit: that. vw wmt ^ He,Vsw,Va TraWYies zta 

presented in Table 6 

To summarize, in this paper wc are concerned with the interzone, 
mtersector, and cross-national stability of the following factors 


Universal factors 
Socioeconomic status 

SES in metropolis, zones, and sectors of Boston Table I 

SES in metropolis, zones, and sectors of Helsinki Table 2 

Famihsm 

Famihsm m metropolis, zones, and sectors of Boston Tabic 3 

Progennure (young famihsm) in metropolis, zones, and 

secton of Helsinki Table 4 
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Special factors 

rmhmMt ehu m mclropolB. ima. anii scclors of Bos'on Tabk 5 
Suedish lansuagc ^ J 

farcer ufl^ww ^aslheyappearmmelropolis.zones.and 
Kura/ism J sectors of Hclsmki Table 6 


FIND/mS^UNiyE/tSAL FACTORS 
Socioeconomic Slams Factors in Metropolitan Boston 
Table 1 consists of factor profiles of SES factors one from each of 
the eleven Boston matrices The table includes, in addition to factor 
coeJTicients for the 34 variables, the variance of each rotated factor (sum 
of the squared factor coefficients), the rank of the factor’s variance in 
Its own matrix, and the coefficient of congruence measuring the degree 
of similarity of each zone and sector SES factor to the metropolitan 
SES factor (Harman, I960, pp 256-259) In the body oftbe table, factor 
coefficients of 45 or above arc enclosed m parentheses to indicate that 
these coefficients are large enough to merit special attention m inter- 
preting the factor, since (hey mean that the factor accounts for at least 
one fifth of the variable’s variance In addition, italicizing and capitali- 
zation of variable names is cmplojcd, as explained in the footnotes to 
the table, to emphasize variables that load strongly and consistently on 
the several SES factors 

Examination of Table I shows that seven vanabfes have factor co- 
efTicients of 45 or higher with appropnate signs — positive for high 
status indicators and negative for low status indicators —■ in ten of the 
eleven matrices 
20 Room crowding 
21. Housing defects 

22 Median family income 

23 Low family income 

30 Male professional and managerial occupations 

33 White-collar occupaltons 

34 8 grades school plus 

These seven variables include Ibc conventional objective measures of 
socioeconomic status — housing income, occupation, and education — 
and may be thought of as the delining vanables for the SES dimension 
in Boston In addition, it will be noted that the coelBcients usually 
exceed .70, which means that the factor accounts for more than 30 i«r 
cent of the variable's variance and that the variable's loading on the 
SES factor is the highest in the malna in question An additional four 
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variables have factor coefTicicnts of 30 or above (with appropriate 
signs) m ten or eleven of the eleven matnees 
25 Walk to work 

27 Males unemployed 

28 Males “at leisure” 

32 College graduates 

While not of so fundamental a character as the seven variables listed 
earlier, they solidly confirm the interpretation of the SES factors With 
eleven of 34 variables so strongly loaded on these SES factors, the 
strength and importance of the SES factor is already clearly indicated 
It IS reinforced by the large factor variance of the SES factors in their 
respective matrices in all but the inner zone (sec Table 1, bottom) and 
by the fact that the SES factor ranks first in variance m seven, and 
second in variance in three, of the eleven matrices 

Coefiicicnls of congruence reinforce the impression of similarity 
among the factors of Table I In nine of ten comparisons, zone and 
sector SES factors are “very similar” to, or “congruent” with, the 
metropolitan SES factor (coefiicicnts of congruence « 801 to 972) 
Again, the one exception is the SES factor m the inner zone, which is 
only “moderately similar” to the metropolitan SES factor 
A closer examination of the SES factor in the inner zone as shown in 
Table 1 will reveal that m addition to having conspicuously lower 
variance than the others in the table (ranking only third in importance 
m Its matrix), this factor usually has decidedly lower factor coefficients 
for SES-defining vanables than the others The reason is not far to 
seek “ there are, m fact, /«o SES faclore in the inner zones One 
of these is bipolar with famihsm Unitary in the wider metropolis, 
the SES factor divides into two factors m the urban core ’ (F Sweetser, 
1965c, p 9 ) The evidence for this conclusion is best seen by companng 
directly the factor coefficients for SES-dcfining vanables as they appear 
in the SES factor for the inner zone (Table I) with the factor coefficients 
for the same vanables in the familism factor in the inner zone (Table 3) 


SES Defining VarlabUf 

20 Room crov. ding 

21 Housing defects 

22 Median family income 

23 Low family income 
30 Male prof mgr occs 

33 Whitc-coflar occs 

34 8 grades sch plus 


Inner Zone Factor Coefficients 


SES Factor 

Famihsm (SES) Factor 

- 36 

(77) 

(- 56) 

35 

33 

(- 47) 

(- 49) 

18 

13 

(- 86) 

11 

(- 83) 

34 

(- 68) 
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So far as these set en variables are concerned, it is evident that the inner. 
Mite fainilism factor- ,f refltcterl— is more strongly a measure of 
bES than IS the SES factor of Table I This is particularly notable in the 
loadings for the three occupational and educational variables which 
are very high for the familism (SES) factor of Table 3 but low or 
negligible for the SES factor of Tabic I It would appear from these 
data that, in the inner city, socioeconomic status as related especially 
to occupation and education is negatively associated with familism 
(child-reanng) but that there is a second dimemion of SES also operat- 
ing, principally m terms of housing defects and low family income, 
which is independent of familism. as the zero and near-zero loadings 
for variables ] (age6-13ycars)and 10 (fertility ratio) on the inner-zone 
SES factor of Table I show 


As between zones and sectors, it may be noted m Table ! (hat none 
of the ethnic variables (numbers 4-8) have factor coefficients of 45 or 
above on the metropolitan or zonal SES factors but that there are 
Several such loadings on sector SES factors. Apparently, for the total 
metropolis and withm zones, there are no strong correlations between 
measured aspects of ethnicity and the SES factor, but, within certain 
sectors, patterns of ethnic distribution are such as definitely to relate 
the ethnic character of residential areas to their socio-economic status 


Again, It may be noted, as shown by factor coefficients for variables 
1 1 and 12 m Table I. that home ownership and detached dwellings are 
more strongly associated with the status dimension m the total metrop- 
olis and in most of the sectors than m the three zones The relationship 
IS particularly weak tn the inner zones These differences arc, no doubt, 
a direct consequence of the typical gradient in dwelling types from 
multiple dwellings m the core to detached houses m the suburbs zones 
tend toward a degree of similarity in housing types, but the metropolis 
and the sectors show marked contrasts 


S/a/us Factors in the Helsinki Region 
As m Boston, Die SES factors in the mclropote and in the zones and 
sectors of the Helsinkr Regiotr are consisterrt and typically fairly sironj 
As (S showrr in the summary data at the bottom of Table 2, hretor 
variances range from moderate to large. and the SES factors rank first 
or second in importance in five of the seven matrices Congraence of 
zone and sector SES factors rvilb the melropoklan SES factor range 
from “markedly similar" to “congnienf ’ ( 799 to 961) And the eight 
defining variables — for HelsinU variables with factor coefficients of 
.45 or Ibove in six or seen of the seven matnees _ measure expected 
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aspects of housing occupation and education (no income data were 
available for Helsinki statistical areas) 

14 Room crowding 

1 5 Living space 

19 Dependence on industry 

20 Dependence on services 

22 Blue collar occupations 

23 9 grades school plus 
30 Small dwellings 

33 Housing defect-plumbing 

One additional vanable — housing defect-heat — loads at 30 or 
above m six of the seven matrices As in Boston, a majority of the 
coefficients for defining vanables are above 70 this shows that half or 
more of the variable's variance in the matrix is accounted for by the 
SES factor Unlike Boston, however, none of the geographic subdivi' 
sions show a differentiation of the SES dimension into two distinct 
factors 

Helsinki SES factors m all three zones and in the east sector have 
loadings of 30 or higher for the ethnic variable (No 8, Swedish 
language) though not in the metropolis or the west and north sectors 
The loadings arc especially large m the inner and outer zones (at 8! 
and 54, respectively) and arise from the marked concentration of 
Swedish speaking Finns in certain of the higher status statistical areas 
of the City of Helsinki and from persistence of prevailingly Swedish 
speaking agglomerations — often of higher status — m the outlying 
portions of the Helsinki Region Generally, ethnic homogeneity char 
acterizes Finland and Helsinki, m contrast to the typical ethnic hetero 
geneity of Boston and most other Amcncan metropolitan communities 
In the City of Helsinki itself, as is shown in F Sweetser (1963 and 
19656) there is no distinct ethnic (Swedish language) factor, although 
the assoaation of Swedish speaking Finns with higher status areas is 
evident Weak Swedish language factors (discussed later) appear in 
only two of the seven matrices for the Helsinki Region This contrasts 
sharply with the three ethnic factors found repeatedly m Boston Irish 
middle class (discussed later) nonwhite andItahan(F Sweetser, 1965c) 
Tabic 2 also shows a slight tendancy toward a negative association of 
familism and SES m both the west and the east sectors as the negative 
signs and coefficients above 30 for variables 2 (age 0-4 years) 3 
(age 5-14 years) 9 (married) and 10 (fertility) indicate, but none of 
these factors shows the clear bipolanty of SES and familism noted in 
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on age vanables (vanables 1, 3. and 3),and even vanable 10, the fertility 
ratio, might be thought of as essentially a measure of age structure 
Second, the failure of vanable 2 to load at meaningful levels in four 
matrices has the effect of converting the general “familism” of the 
metropolis and the other 2 ones and sectors into a special kind of 
famihsm, which might be labeled “young famtlism” or “progeniture ” 
But m the City of Helsinki an earlier analysis showed that there (child 
centered) famihsm typically differentiated into two factors, interpreted 
as “progeniture” and as ‘established famihsm” {F Sweetser, 1965^7) 
This difiercntiation of famihsm into distinct factors persists m the 
Helsinki Region, as indicated later and in Table 4 

Progeniture (Young Famihsm) Factors m the Helsinki Region 
The earlier (42 vanable) analysts of the City of Helsinki produced, 
not one, but three famihsm factors, desenbed as “progeniture,” “es- 
tablished famihsm ” and “postgcniturc * (F Sweetser, l96Sb, 1966) 
The same three famihsm factors also emerge in the set of seven matrices 
for the Helsinki Region Taken together, these Helsinki famihsm factors 
are clearly dominant in each of the soen matrices understudy However, 
they are sometimes differenliaied into distinct factors and sometimes 
merged with other factors, so that their complete elucidation would 
present a far more complex picture than can be presented here For- 
tunately, progeniture emerges consistently m all sesen matnees, as the 
fartor profiles m Table 4 for Helsinki metropolis, zones, and sectors 
show It IS a distinct factor in six matrices and merges with the other 
ttto aspects of familism into a single factor m the seventh This merged 
factor, in the inner zone, consequently takes on the character of a 
general famihsm factor of the sort more typical in Boston 
As with Boston’s famihsm factors, the four defining vanables for 
progeniture in Helsinki (variables with appropnately signed loadings 
of 45 or above in six or seven of the seven matnees of Table 4) accent 
age structure 

2. Age 0-4 years 

4 Preadolcscenl ratio (proportion 0-4 among those 0-14 years of 
age) 

5 Middle age ratio (proportion 20-39 among those 20-64 years of 
age) 

10 Fertility (ratio of children 0-4 to men age 20-49) 

Three additional variables load on progeniture with appropnate signs 
at ^0 or higher mat least fiveoftheseven matnees median age(variable 
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these distinct ethnic factors sometimes merge into a single — usu^y 
bipolar — factor and sometimes fad to appear in a given matrix 1 ne 
over-all pattern of ethnic factors m the eleven metropolitan zone and 
sector matnces is too complex for analysis here However, the factor 
profiles for the Irish middle-class factor as it appears in eight of the 
matrices w ill illustrate the ways in which special factors may vary among 

zones and sectors ^ ^ c ti 

First a look at the summarizing data at the bottom of Tabic o win 
reveal that the eight Irish middle-class factors all have relatively low 
variances and rank from fourth to sixth m potency in their respective 
matnces, thus contrasting sharpl> in strength with SES and famihsm 
factors in Tables 1 and 3 These are evidently minor factors, as well as 
special factors The congruence of the zone and sector Irish middle-class 
factors with the metropolitan Irish middle-class factors shows marked 
similarity for five matnces, but the factors in the inner zones and in the 
northeast sector are only moderatel) similar to the metropolitan factor 
With such low factor vanance, it is to be expected that the number of 
defining vanables will be few Indeed, there are only two 

6 Insh stock (loadingat 45orabovem8of 

8 matrices) 

31 Male clerical service employment (loadingat 58 or above in 6 of 
8 matrices, and at 38 or above 
m7 of 8 matnces 

In addition, variable 29, employment m manufacturing, loads (nega- 
tively) at values above — 45 on three of the Irish middle-class factors 
Where these negative loadings occur, of course, they reinforce the 
“middle-class ' aspect of the factor 

Four of the factors shown in Table 5 arc unambiguously Irish and 
middle-class, as shown by the facts that (1) both defining vanables have 
higher loadings on this factor than on any other in their respective 
matrices, and (2) the coefficients for the two defining vanables are the 
highest in the profile for the factor These unambiguous Irish middle- 
class factors occur in the metropolis, the middle and outer zones and 
the northwest sector 

Two of the factors of Table 5 seem Irish enough but lack a clearly 
defined middle-class aspect In the northeast sector and the west sector, 
factor coefficients for the Insh stock variable are higher than for any 
other factor in the respective matnces, but coefficients for the second 
defining variable, male clerical and service occupations, are rather low, 
at 22 and 38, respcctiv ely In the west scaor, inspection of the factor 
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Evidently, this is a bipolar factor, which might be described as “non- 
whitc/forcign stock ’ because of the high positive loading for the non- 
white variable and the high negative loadings for foreign parentage, 
Irish stock, and Italian stock 

In the inner zone an analogous but more complex situation exists 
Here, the Irish middle class factor of Tabic 5 accounts for less of the 
variance of the Irish stock variable than either of two other factors 
The character of these additional ethnic factors is show n by the follow- 
ing data on ethnic variables in this matrix 

Variables Factor Coeffictenis (Reflected to Appropriate signs) 


4 Foreign bom 

Irish Middle 
Class 

14 

Factor X 
(87) 

Factor Y 
- 07 

5 Foreign parentage 

- 02 

(66) 

(- 57) 

6 Irish stock 

(45) 

(- 50) 

(- 48) 

7 Italian stock 

- 30 

(80) 

- 18 

8 Nonwhite 

03 

- 21 

(86) 

Factor variance 

266 

3 80 

3 08 


It IS quite evident that factor A' is a bipolar ‘ Italian/Irish" factor 
Identified as Italian by the high loading for Italian stock ( 80), its 
bipolarity with Irish ethnicity is brought out by the substantial factor 
coefficient of— 50 for Irish stock Similarly, factor Y is clearly a 
bipolar nonwhiie/lnsh” factor As in the south sector, inner zone 
Irish ethnicity as an ecological dimension is differentiated — this time 
into three distinct ethnic factors 

The three sectors of metropolitan Boston where no factor that could 
be classified as Irish middle class was found re\eal two different 
patterns of loadings for the Irish stock variable In the southwest sector, 
this variable repeats the pattern of the inner zone and the south 
sector m having its highest factor coelficicnt for the matrix ( — 73) on 
a bipolar ethnic factor that is best interpreted as nonwhite/foreign 
stock” In the north and the southeast sectors, however Irish stock 
loads meaningfully only on the socioeconomic status factor with 
coefficients at — 74and — 76 respectively (see Table 1) In these two 
sectors then ‘ Inshness combines negatively with SES but neither 
emerges as a distinct factor nor merges with another ethnic factor 

Swedish Language Career Women and RuraUsm Factors m the 
Helsinki Region 

The Swedish language factor (Table 6) is especially interesting because, 
as noted before, a prior search for a Swedish ethnic factor in the City of 
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^ * lOASh 19661 Now, in the metro- 

include* 


8 Swcdish-speaking Finns 

14 Room crowding 

15 Living space 


(loading positively) 

(loading negatively) 

(loading positively) 

15 Living space ' 

The factor evidently assoc, at» '"“H”"'” g tZl there « “ 

with more dwelling space " s„,d,sl, language variable loads 

Swedish language factor, inste • 
most highly on a variety of other factor 


Matrix 
Inner zone 
Middle zone 
Outer zone 
North sector 
East sector 


e' ” °"^,“a'Ss «h ‘none as high as W 

(dispersal :ymetSvSor"hat combines career 
“'women and postgemtore 


tasisvx"" women and posig-:--- 

ftartnr is the weakest in the 
the Swedish language 'acio' * (bottom) 

Even where It app a» th ^ „ Table e 

mains, wtlh very ibe Helsinki Region, langu S 

„ „ evident tlial m ' “ „f meltopohlan ccologieal dih 

more than a>ery min 

tiation e,f the Helsinki ' “,srTihle6 Ezami- 

Cnreeriiuttici. facto » 

£3=ts:-— 

29 Detached dwellings 


29 D°elch=d dwellings ^ p„f.les 

Two more variables arc at 32 or above 

2S Apartment bouse nac (toadmg negaUvCy) 

32 Housing defect 
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k Room crow-ding’ (—55) 

). Lising space (46) 

) Home ownership —15 

I. Women m labor force —34 
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In the east sector, a factor merging features of this career women 
profile with features of postgeniturc (not shown in Table 6) is the most 
potent in the matnx and has a factor variance of 9 84 In the west 
sector, the career women configuration is almost completely merged 
with the SES factor shown m Tabic 2, as inspection of the high factor 
coefficients for the above five variables shows It is notable that the merg- 
ing IS a “positive ’ one in the sense that higher socioeconomic status 
and higher levels for career women are combined This is consonant 
with the onginal identification of the factor as it appeared m the 42- 
variable matrix for the Cil> of Helsinki “Not only working women 
but working women in the higher- rather than the lower-status white 
collar occupations mark the factor” (F Sweetser, 19656) Although 
detailed information on women’s occupational status is lacking in the 
regional variable list, this positive merging of career women and SES 
IS evidence that the original characterization can stand 
It will be noted that coefficients of congruence of the inner and 
middle zone career women factors at the bottom of Table 6 indicate at 
least marked similarity with the metropolitan factor and that these 
factors are of moderate strength ranking third m factor variance m 
their respective matrices It will be noted, too, that the metropolitan 
factor IS to a degree a “merged ’ factor also, it includes appropriately 
signed high factor coefficients associated with postgeniturc in variables 
3 (age 5-14) and 13 (size of household) And, together with the career 
women factor in the middle zone, it has high loadings on the defining 
variables of ruralism (to be discussed next), though with opposite signs 
Two factors interpreted as niralisni are shown in Table 6 for the outer 
zone and the north sector The> derive their meaning essentially from 
high factor coefficients for three vanables 

18 Dependence on agnculiure (positive) 

21 Emplo>ers (positive) 

25 Loganihm of population density (negative) 

As just noted, the career women factors m the metropolis and in the 
middle zones have high factor coeffiaents for these same three vanables, 
although with opposite signs It is not surprising therefore, that the two 
ruralism factors of Table 6, when tested for congruence against the 
metropolitan career women factor turn out to be moderatel> similar 
similar on the basis of reflected signs (indicated b> the negative 
values for the coefficient of congruence m Table 6) It is notable, how- 
ever, that loadings of variable 17, women in the labor force, are not 



ecological factors in meiropolitaji zones 


445 


ECOLOGICAL FACTORS 

^i.c« fnrtnrs of Table 6, altbough they 
remarkably high for the U\o ru 23 and - 41, respcctuely) 

are negative, as would be expected (- 23 and 

discussion AND CONCLUSIONS 

Stablhl) of Ecolosical Fmlots metropoWan 

It had been anticipal':'! “^'"“factors denvcd from analysis 

ecological differentiation as HeUmW, ivonld prove 

of metropolis, zones, and seao the case, 

to be more stable than the ^ stability to ."fabHitl 

although the data reveal “ sES faetois are consisten y 

factors, rather than aclean ^arlanccs m nearly 

potent, as measured by the si consistent y 

all of the eighteen matrices ^sand sectors of both metrop- 

IS a highly stable fa 

comparisons . of famihsm. 1'°"'''' • revealing a 

The universal dimension^ ^ p rerful facto 

Boston it IS manifest ,n the outer z e fam.lism, 

generally i;®'"^'Xctr« “ “ the a-l” 

south, and southeast _,ngful loadings fo ^niy one strong 

„rprogeniture,vvdhol™e“mJ^^^^_^,^^„^^^^^^^^^^^ 

(see Table 3, variable 2) the zone) ‘ ^re cither 

general fo"''"™ ^'ramiHsib. and P'-^^^rged W'th other faetors 
progemture, * jr^met factors or are m g rivalmg 

fully differentiated ml r 

Colleetively, these He “nme 

SES m strength in a ,he P™®'"' escept in the inner 

weak, minor factors ^ gj,.,„r eve > „re across zones 

The special, cultur y 



446 


FRANK L SWEETSER 


structure exhibit quite varying degrees of stability In part, this is due 
to the fact that the special factors are often less potent than the universal 
factors and hence more subject to random statistical variation But it is 
also to be attributed to the interactive relations of relevant variables in 
particular community settings For example, the analysis of the Irish 
middle-class factors in Boston's zones and sectors showed that the 
ethnic dimension in that metropolis behaved statistically m much the 
same v-ay as the familism dimension in Helsinki, producing three 
distinct ethnic factors (nonwhite, Italian, and Irish) in some matrices, 
which failed to appear, or merged in varying combinations, in other 
matrices Yet strength was moderate and congruence with the metro- 
politan factor was often marked or close for zone and sector Irish 
middle-class factors The special factors of career women and ruralism 
m Helsinki show similar permutations, and the weak Swedish language 
factor appeared in only two matrices On the whole, the data support 
the generalization that these special factors are relatively unstable m 
mterzone, intcrsector, and cross national comparisons 


Technical (Formal) Classificatton of Metropolitan Ecological Factors 
In previous papers, the writer has placed much emphasis on the in- 
variance of factors in the domain of social ecology (F Sweetser, 1965fl, 
19656) But factorial invariance has two aspects (1) in relation to the 
emergence of identical (congruent) factors from matrices based on 
difrercnt sets of variables for the same communities at the same point 
in time, (2) in relation to the emergence of identical (congruent) factors 
from matrices for cither cross sectional comparisons of different com- 
munities or longitudinal comparisons of the same community at different 
points in time The first type of invariance has been tested in several 
ways, and seems Vicll validated by the work of Schmid and Tagashira 
(1964) and F SN^eetser (1965a, 19656) It is given further support by 
the dose Similarity of the factors denved for Helsinki from the present 
33 variable analysis and those previously reported based on a 42- 
vanablc analysis (F Sweetser, 1966) 

The second type of invariance," however, is really the quality of 
stability-instability we have just been examining Clearly, only for the 
universal SES factor can such invariance be claimed But the existence 
of a continuum of stability-instability among ecological factors does 
offer an empirical basis for the technical or formal classification of such 
factors which should also be a useful guide in their further analysis 
Such a typology is suggested m the accompanying table 
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Formal Types 
(1) Unitary factors 


(2) Differentiating 
factors 


(3) Combinatory 
factors 


ECOLOGICAL FACTORS IN SETROPOICAN TONES 
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dimension 

^ (may combine with 


(0 
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Substaniixe Classification of Metropolitan Ecological Factors 
Throughout this paper, ecological factors base been characterized as 
either “universal” or “special ” The universal factors have been con- 
ceived as deriving from necessary societal and cultural features of 
urban-industnal societies and therefore present in alt metropolitan 
communities m all such societies The special factors have been pre 
sented as deriving from the distinctive social structures and normative 
systems of nations, regions, or local areas Essentially, this distinction 
has been applied in cross-sectional comparisons of the ecological 
structures of Boston and Helsinki at a single point in time When 
factors, or types of factors were found to be matched in both com- 
munities, they were called “universal”, when factors or types of factors 
were found to be distinctively different in the two communities, they 
were called ‘ special ” A primary objective was to discover whether, 
as a preliminary contnbution to the development of a typology of 
metropolitan communities based on their ecological structures, the 
analysis of Boston and Helsinki could support prototype descriptions 
of a ‘North American” and a ‘ Scandinavian” type of metropohs 
Previous comparison of the two total metropolises had produced a 
schematic summary of the substantively identified factors and types of 
factors in the two communities (F Sweetser, 1966, Table 5) Now, a 
revision of that schematic listing is offered as a preliminary formulation 
of the two community types m the accompanying tabic 


Tho Types of KietropoUtan Ecological Structures 


Metropolitan Type 


Fundamental Dimension 
{Factor Type) 

^ORTH AMERICAN 

1 

P Socioeconomic status 


Umtersal factors < 

Familism/age structure 
i^Residential/migrationst atus 


Special factors 

Ethnicity 

SCASDmAVlAN 

1 

r Socioeconomic status 


Umtersal factors -i 

Familism/age structure 
l^Residential/migration status 


Special factors 

Career women 


Residcntial/migration status as a type of ecological factor isdiscussed 
indctailmF Sweet5er(I965cand 1966) it has not been treated in this 
paper, as explained before because the available data for the Helsinki 
regional analyses were insufTicicnl to define the factor clearly On the 
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the natural tendency to follow the line of least resistance and utilize 
someone else’s delimitations of metropolitan areas or regions But a 
fully objective comparative study of metropolitan ecological structures, 
whether interregional or cross-national, would appear to require that 
social ecologists develop some means of delimiting metropolitan com- 
munities on a uniform basis 


Appendix A 


Dernitions of 34 Variables Derived 
FROM Census Tract Data, Boston, 1960* 


1 Age 6-13 years 

2 Age 14-17 years 

3 Age 65 years plus 

4 Foreign born 

5 Foreign parentage 

6 Irish stock 

7 Italian stock 

8 Nonwhitc 

9 Sex ratio (single) 

10 Fertility ratio 


Percentage of population age 6-13 years (elemen- 
tary school ages) 

Percentage of population age 14-17 years (high 
school ages) 

Percentage of population age 65 years and over 
(retirement ages) 

Percentage of population foreign bom 

Percentage of population native born, of foreign 
or mixed parentage 

Percentage of population of foreign stock 
(Ireland) 

Percentage of population of foreign stock (Italy) 

Percentage of population nonwhite 

Single males age 14 plus — single females age 14 
plus X 100 

Children under 5 years — females 15-44 years x 
100 


11 Detached dwellings Percentagcofhousingunitsinone unit structures 

12 Home ownership Percentage of occupied housing units occupied 

by owners 

13 Women m labor force Percentage of females age 14 and over in the 

labor force 

14 Working mothers Percentage of married couples with children 

under 6, wife in labor force 

15 Nonfamily Percentage of population in institutions or living 

as unattached individuals 

16 Residential stability Percentage of population age 5 and over in same 

house in 1955 as in I960 

17 Intra-SMSA migration Percentage of population age 5 and over which 

moved within SMSA from 1955-1960*» 
l6 Inter SMSA migration Percentage of population age 5 and over which 
^ moved into SMSA from 1955-I960*> 

19 New housing Percentage of housing units built from 1950 to 

1960 



ECOLOO.CAL FACtORS 11 MCIHOPOLITAN ZONE 


453 


pcicentasc of occupid “""s mlh 1 01 

iJcr'tal^trrpTumbmg o, 

ii. iivuoi..£, central heal 

22 Mediaa mcomeurdar 

ing to work by auto 


20 Room crowding 
21. Housing defects 


2Z Meuiaii •• — 

23 Low family income 

24 Auto to work 
25. Walk to work 
26 Commute 

27. Males unemployed 

28 Males “at leisure" 

29 Employment in mfg 




perceniagvu* — ' 

Pernll""" 

29 Employmwt in mfg „jlM in profoM™! 

30 Mg.pior..gr- 

31 Male cler-serv OKI j„d service occupanons j„j older 

pereenteg. »' oVscheol 

• Sec also F Sweeiser, I^ton Reu« 

• These "standard scores 

pp 54, 55 


32 College graduates 

33. White-collar o«s 
34 8 grades sch plus 


Appendix B 


Statistical Areas a 


Media" ^j'oJuWio" age M 1 


. Median age Porcenwge i”;;-',;„on age 5-H . „ 

;, Age (M yea^ Penxn.ai" "fSien under 15 >ram •>'» "" 
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5 Middic-age ratio Percentage of population 20-64 years who arc 

20-39 years old 

6 Age 65 years plus Percentage of population age 65 years and over* 

7 Proportion male Percentage of population male 

8 Swedish-speaking Percentage of population speaking Swedish as 

Finns principal language* 

9 Mamed Pcrccntagcofpopulation I5yearsandoldcrwbo 

are married 

10 Fertility Number of children 0-4 years per 1,000 males 

20-49 jears*' 

11 Widowed and Percentage of population 15 years and older who 

divorced arc widowed or divorced 

12. Nonfamily Percentage of population 15 years and older m 

one-person households 

13 Size of household Mean number of persons m households with 

two or more persons 

14 Room crowding Number of persons per 100 rooms 

15 Living space Numberofsquarc meters per person indwellings 

16 Home ownership Percentage of dwellings occupied by owners 

(including share-holders) 

17. Women tn labor force Percentage of females age 15 and older who arc 
economically active (excluding those unem- 
ployed and seeking work) 

18 Dependence on Proportion of total population economically de- 

agriculture pendent on agriculture (deciles) 

19 Dependence on Proportion of total population economically de* 

industry pendent on manufacturing industry (deciles) 

20 Dependence on Proportion of total population economically de- 

serviccs pendent on services, including transportation, 

commerce, etc (deciles) 

21 Employers Percentage of economically active who are em- 

ployers of labor 

22 Blue<ol]ar Percentage of economically active who arc man- 

occupations ual workers 

23 9 grades school plus Percentage of population age 15 and older who 

have passed middle school or student examina- 
tions 

24 Female higher Number of females who have passed student 

education examination (for university admission) per 100 

females who have passed middle school ex- 

amination 

25 Population density Logarithm of number of persons per square 

Oos) kilometer 

26 Public buildings Number of public buildings per 100 residential 

buildings 

27 Industrial buildings Number of industnal buildings per 100 residen- 

tial buildings 

28 Apartment house size Number of dwellings per residential buildmg 
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29 Dclached dwJJifjffs 

30 Small d^climgs 

3 1 New housing 

32 Housing dcfccj-hcai 

33 Housing dcfcct- 
plumbing 


m one or two dwelling 

'tss/LSr ''™ 

Percentactf of dwellings built I95I-I960 
Percentage of dwellings lacking centra! heal 
Perccniagtf of dwelltngs Jacking flush toilet 
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Consortium could not base its wort on published tables; thus a great ejort oJ ata 
recover) had to be organized In each state by local historians. This is only one 
away from ifie ultimate stage oJ data organization in historical ecology : the esta 
hsbment oJ indicators oJ community struaure through the processing of origina 
data in registers and other standardized bookkeeping records. This can be donejor 
early Wfrtioni. John i inccnt’s \*ork on the I’oU BooU of Vicrorian England an 
Erik Hogh's analysis of Danish electoral registers during the period of open toting 
from 1849 to 1901 are outstanding examples. The procedure can also be use 
in reanalyzing early censuses: the original protocols ore tranferred to cords or 
tape to make fresh tabulations possible. This is essentially the strategy pursue 
by £inincinu</ Leroy-Ladurie and his team at the Centre de Kecherches His 
toriques in Pans; they are engaged In the computerization of the records 
of the French military orchnes and will, once the data have been organized, be able 
to study processes of change in traditional French society not only at the indindual 
level but also at the level of the commune. The great French school of historical 
demographers have gone even further in the generation of data far the study of early 
developments. L. Heniy and his colleagues base worked out techniques for the recon- 
struction of tables of fertility and mortality for the centuj^ before the first regular 
ceniuj, thi^ have established samples of porijhcr and have tried to derne census- 
type information from the registers of christenings and burials. Laslett and his 
group at Cambridge have tried out a similar procedure for preindustrial England. 
The chapter ot the end ifParC Five offers a fascinating occouni of. this work and 
spells out some of the possibilities of further dcre/opmenr Similar work is under 
way in other countries of Europe. Once such samples have been established, we shall 
tnow much more about the variations in the situations de depart in each region 
and w'lll be able to carry oui much more realistic analyses of the processes of cultural, 
social, and political transjormation brought about through the growth of the 
economy and the strengthening of the centra/ national institutions in each country. 
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and h.s socal microcosm, on 'the onn hand, and broad 
dibons, on tho other Work of this sort is well represented in this 
symposium Yet this obvious example scarcely exhausts the types 
investigations that may profit handsomely from joint exploitation ol 

these data bases , 

Another of these types of inquiry, and one that concerns us priman y 
here, attempts to v.ork from the complementary strengths of both 
types of data to circum\ent their respective ^\eaknesses One of the 
prime attractions of aggregate data is ihcir richness in national variety 
and histoncal depth Such scope gives analytic entree into a range of 
systemic conditions — historical junctures, cultural and developmental 
differences — that survey data obviously cannot now match A prime 
weakness of ecological materials lies in their aggregated, summary 
nature and the intrinsic ambiguities that spring from this fact For any 
pattern of aggregate variation taken alone, it is typical that a number o 
plausible "models” of the underlying process that generated it can be 
imagined, and the aggregate information as traditionally treated pro- 
vides little apparent leverage for a more satisfying diagnosis 
Hov.e\er, a close understanding of what we might call the “in- 
frastructure” of the competing explanations through analyses of survey 
data even from a different time or place often serves to suggest further 
critical tests of the aggregate materials that may help to determine 
which process is the more plausible generator of the aggregate patterns 
Sometimes survey materials will suggest ancillary or modifying variables 
that need to be brought to bear on the aggregate patterns Sometimes 
understanding of the pattern m its disaggregated form suggests more 
incisive \says of organizing the aggregate data from a statistical point of 
view, to sharpen or purify the aggregate effects if indeed they have been 
generated m one way rather than another In the most elegant cases, 
one may have a model or simulation of the suspected processes at an 
individual level If such a model can faithfully reproduce the known 
survey effects while generating comparable gross aggregate patterns, 
one’s power to decode the meaning of earlier aggregate materials is 
greatly enhanced In some instances we may find that parameters im- 
portant to the process as understood for the current period are known 
to have been quite different a century earlier (urbanization, education, 
population age structure, communication processes) Then with a 
satisfactory mode! it is an easy task to reset the appropriate parameters 
and discover whether the model can indeed reproduce the historical 
aggregate information 

Of course, such work can provide feedback in its turn to the design 
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of a logical companion piece for the MacRae-Meldrum paper This 
analysis will be pursued with the aid of illustrative aggregate data from 
the United States and France 

THE BASIC VOTING MODEL 

Although survey research material on voting has been crucial m 
fleshing out the basic model we are employing here, some of the initial 
insights were emphasized by keen students of aggregate voting statistics 
Most notably, V O Key became fascinated with the fact that voting 
returns over long periods of time tended to show long periods of relative 
stability, punctuated at rather rare intervals by large-scale shifts m 
fundamental political coloration of voting districts This was neither a 
trivial nor a self-evident observation The neophyte examining aggregate 
voting records over long periods of time might well not have noticed the 
fact, for, after all, such tabulations present quite a formidable sea of 
change No election is quite like any other election, even for the same 
district, and the party division of the vote is always shifting this way 
and that from election to election What attracted Key’s attention was 
the fact that, although temporal variation in vote outcomes was 
ubiquitous across districts, such variation tended to have a rather clear 
center of gravity for any particular district, and short-term swings m 
actual elections returns could be seen as oscillations around this center 
of gravity In other words, if a district seemed to cast most of its votes 
in the 40-45 per cent Democratic range over a period of time and then 
in a given election swung toward an unusual 55 per cent Democratic, it 
was far more likely to swing back toward the 40-45 per cent range in any 
subsequent election than it was to evolve onward into the 60 per cent 
Democratic range, or even to remain at 55 per cent Occasionally, 
however, the center ofgraviiy of a voting series for a district did appear 
to undergo a step change, with new oscillations beginning around the 
new level of partisanship And most important, such step changes were 
not haphazardly distributed in time for districts but rather tended to 
occur for a large number of districts in the same election and then did 
not occur again for a considerable period of time As we have seen. 
Key labeled these elections “cniical elections" and commented upon 
their occurrence m American history * 

At the same time, survey research data were demonstrating that voters 

*y O Key,Jr.“AThcoTyofCrHKalEkctions.* Journal of Politics. 17 {195S). 
pp I-Io 
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abrdmrail “b <bt paMy In ivh,ch they felt a mote 

abtdmtailesiance Thus foraamplc m tPK General Eisenhoyertvoi. 
me first substantial Republican majority for President that the nation 
a seen m 24 years, and political pundits becan to write books 
prophesying a new era of Republican ascendance Vet survey data from 
that year showed tliat the Eisenhower victory was due not to masses of 
new reermis to the Republican faith but rather to solid backing from 
traditional Republicans and large rates of defection on the port of 
Citizens who told interviewers they continued to think of themselves as 
Democrats It is no secret of course that these more baste expressions 
of loyalty turned out to be far more important as long run predictors of 
voting decisions in the middle and late 1950 s than the short run Hirta- 
tions with the Republicans 

Of course, the isomorphism between Keys observations about 
properties of voting trends and this Kmd of survey information was 
immediately clear The short term oscillations in voting senes could be 
attributed to transient defections recognized as such psychoJogicahy 
by the actors involved and the center of gravity of such returns from a 
particular district would (end to represent the basic division of party 
loyalties elicited by survey measurements of "party identification 
The stability of these identifications for individuals over time was to be 
heavily documented m survey studies m ensuing years And critical ' 
or “realigning ' elections (although none had been available for close 
study since survey research techniques had developed) would be ones in 
which voters not only crossed party lines in the sense of a short term 
defection but undervveni full Jledged conversion of loyalties as well * 

Since the time when these notions came mto focus, more intensive 
work with sample survey materials has proceeded on all the fronts tJiat 
the underlying model suggested Factors govemine the development 
and intensificationof party lovaliics were explored Conditions facihtat 
ing or impeding short term defections began to be systematized Data 
collections associated w ith each election permitted a far more exact and 
direct measurement of Uve short term forces producing defections m 

varying specific election races at diflerenl levels of office* 
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eras for which no survey data are available Perhaps the simplest and 
broadest lesson from the model is the fact that what registers as change 
in aggregate voting statistics — the swing of a vote division m a district 
from one election to another — can be of two radically different kinds 
the short term defection under the allure or repulsion of immediate 
stimuli as against more fundamental long range conversion Further- 
more the faetthatmost electoral change is of the former, rather than the 
latter, kind m the United States has some critical implications for any 
interpretation of voting change in briefer periods Historians attempting 
to understand the motivations prompting the electorate or portions of 
It m an election ti have long compared /j outcomes with the state of 
affairs at /j, or the previous election and supposed that an analysis of 
these changes and their social correlates would provide the nearest 
possible approach to an accounting of the dynamic forces driving voters 
in one party direction or another at the /j election Yet, once it is clear 
as an actuanal fact that in the United States defection rather than con- 
version accounts for the vast bulk of electoral change one implication 
of the model is that something approaching half of the change in vote 
proportions from one election to the next needs to be conceptualized not 
as the product of dynamic new short term forces assoaaled with the 
election but rather as the effect of a relaxation of short-term forces that 


were new and dynamic at For example, it can be shown that Prot- 
estants in the United States moved away from the Republican Party 
and toward the Democrats more vigorously than did Catholics at the 
lime of the 1964 election (where the base line is the previous election) 
Should we interpret these aggregate patterns as telling us something of 
an unexpected set of short term forces related to religious differences 
in the Goldwater Johnson contest*^ No of course not These patterns 
have nothing to do with the election situation m 1964 per se but rather 
reflect the inevitable outcome given the relaxation of the forces pieculiar 
to the 1960 election 


The model itself suggests a proliferating array of problems to be 
attacked with aggregate statistics while lending guidance as to how 
those statistics should be treated statistically and ultimately interpreted 
The MacRae Meldrum paper involves a whole network of deasions 
about the organization of the data which such a model prompts The 
averaging of time senes to provide a stable base line, the introduction 
of an extremely minor oflice to minimize the error or noise ’ in certain 
estimates that the large scale defections associated with well known 
candidate personalities can produce, the extraction at a point of the 
traditional component of the vote, in order to punfy the vanance 
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fash, on that, although jointly oonfirmed by surtcy and 
m the current period, could be extended back to periods for which the 

survey ‘ crutch" becomes unavailable ’ 

The number of fascinating theoretical problems that can be handled 
m a much more meaningful fashion once the capacity to isolate short- 
term and traditional components from aggregate voting senes is rclinco 
seems rather long One brief example that may be of interest outside me 
American polity has to do with the progressive "dercgionahzation or 
‘ nationalization ’ of electoral responses which is as apparent to even a 
cursory inspection of American historical statistics as it is in the voting 
statistics of many other nations over roughly the same period There arc 
obvious statistical tools for mapping out these trends over time in 
precise ways, and such a mapping laid against other extant information 
on the development of mass national communication systems, coul 
provide important information on the progressi\ e integration of national 
politics in the modem period However, m just such a problem as this, it 
would be of tremendous interest to treat election returns over the past 
century resolved into their two mam components, for it is likely that the 
deregionahzation of the short term component, for self evident reasons, 
has been leading the erosion of regionalism m the traditional com- 
ponent by 50-80 years A mapping of these properties of the raw vote 
materials without the components isolated, would give an “averaged 
trend that would be thoroughly misleading 


VARIAPiCE PROPERTIES OF AGGREGATE VOTING SERIES 

The MacRae Meldrum paper focuses its attention on that portion 
of the model which involves the mean or central tendency of a senes of 
votes over time, and it seeks to identify those infrequent points at 
which this central tendency shifted significantly If the central tendency 
IS one important property of a sequence of votes, its variance is another 
We have already spoken of short term oscillations of a vote senes 
around a (temporarily) fixed mean It is clear, conceptually that these 
oscillations might have quite difTcrent amplitudes for different popula- 
tions even though both populations were responding in the same 
general direction at the same time to the same set of short term forces In 
the American case, one population might swing 2 per cent to the Re 
publican side in one election, and then 4 per cent to the Democratic side 

■’See The Concept of a Normal Vote Chapter 2 of A Campbell P E Converse 
W E Miller and D E Stokes Eleettonsand the PoUticai Order Wiley, 

1966) 
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Allhough the evidence is far more fragmentary, there is at least some 
reason to magme that a high-var.ance voting "'““'‘‘J’' T" 

susceptible, given a certain degree of crisis to approaches from ad hoc 
parties mushrooming outside the traditional parly system than vou d 
low-variance voting populations under the same objective 
other words, a low-variance population seems more firmly anchorco 
in the ongoing party competition than docs a high-sariance population 
And It may well be that the notion of such a variance property is more 
readily transferred from the simplistic terms of the American two-party 
system to other multiparty politics than arc questions of the central 
tendency of the vote and its relationship to the magic 50-50 watcr- 


An interest in the variance property implicit in the model leads quite 
naturally to a consideration of differences in expressions of strength o 
party identification in sample survey work In other words, we should 
certainly expect a population of adults indicating that they arc strongly 
identified with one or another party to show a vote record over tune 
with a lower variance in its party division than a population of adults 
who only felt vague “leanings” toward a party at the starling point 
Furthermore, there is reason to believe that there arc significant dif- 
ferences in the over»all level of allegiances between nations at any point 
m time, for the same nation at diflercnt points in time, and across 
certain predictable subpopulations of a given nation at any time One 
of the fairly reliable findings m the latter category is that the strength 
of parly allegiances grows with participation in the system, and for 
most mature systems, an elderly cohort of voters is likely to express far 
fewer equivocal allegiances to existing parties than does a cohort of 
younger voters Working from sample survey information concerning 
these dilTerential strengths of identification by age and the differential 
responses to the same short-term forces by strength of identification, we 
have simulated the behavior of a new cohort of American voters (ages 
twenty-one to twenty-four) and an elderly cohort (over seventy), 
bombarded by the same samples of short-term forces over a sequence 
of voting “trials The variance of the vote performances of the young 
IS substantially greater than that of the oldest cohort 
The obvious conceptual relationship between strength of party 
identification, as it might be observed directly in sample surveys, and 
the variance properties of aggregate voting senes opens the possibility 


’ The model bears some similanty to that discussed by McPhec in Chapter 5 of 
William N MePhee and William A Glaser, Public Opinion and Congressional 
Elections (Glencoe, III The Free Press, 1962) 
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over the period from 19l9to 1958, and his index of ipobility of the vot 

shows far higher values for France over this period than for either of the 
other two countries ■' This impression of relative instability lits wiin 
deductions drawn from some treatments of sample survey data ro 
postwar France 

None of thisnewcrwork clashes with the Siegfried observations in any 
necessary way Siegfried was dealing primarily with voting returns for 
the early years of the Third Republic, and it might easily be claimed 
that the trauma of two successive wars had served to shatter any ba^s 
of partisan stability that might have pertained in an earlier epoch On 
the other hand, it might be argued instead that Siegfried’s amazemen 
at the stability of voting lines was mainly attributable to the expectations 
he had brought to the data , had he sal down for an intensive analysis of 
voting statistics from some other countries rather than France, his 
amazement might have been even greater 
There are other senses, too, in which the Masters article does no 
attempt to confront the Siegfried thesis Masters operated on the grossest 
of all possible aggregate data vote divisions summarized at the nationa 
level only, for each of the major elections during the period of interest 
m each of the three nations The * index of mobility’ is a simple first- 
order difference between the proportion of total national votes that a 
party received m one election (e,) and in the next election of the same 
type (cj) Such absolute differences are summed across parties to produce 
a total that describes over-all mobility across the pair of elections in 
question 

Although such first differences are indeed one of the most legitimate 
and accessible ways of expressing what we could call the short term 
variability of the vote, in this comparative context they raise a number of 
methodological questions while more or less escaping the mam thrust 
of Siegfried s contentions In the first instance it is apparent that the 
operation of summing differences across parties may have a radically 
different meaning for a system where ten or more party groupings can 
contribute to the total index for an election than for a system where 


“R D Masters UnemelhcKfepourincsurcrlamobi1il6desattitudespohtiques 

Revue Frartfatse de Science Politique 10 3 (1960) p 658 

‘*P E Converse and G Dupeux PolHicizauon of the Electorate in France and 

the United Slates Public Opinion Quarterly 26 1 (1962) p 1 MacRae elsewhere 
argues from postwar French survey data that interchanges of support between at 
least the stable traditional parties have been less than the comparable movement in 
the United States See Duncan MacRae Jr Parliament Parlies and Society m 
France J946-I958 (Sew Yotk St Martins Press 1967) p 240 
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These remarks should not be taken as a critique of Masters, par- 
ticularly as It IS we who have introduced Siegfried into the discussion, 
and not Masters himself What we wish to emphasize more than any- 
thing else IS that terms like “change” and “instability" can have quite 
different referents and hence diagnostic operations for their discovery 
in aggregate statistics It is our argument that models refined in survey 
analysis can be of great help in clarifying such distinctive referents 
We have already suggested that short-term variability of the kind that 
Masters taps, in at least a gross national level sense, is a symptom o 
weakness in underlying party identifications at an individual Ie\e 
However, we should like to approach the problem from another angle, 
one that minimizes the influence of short-term variation in favor of long 
term variation and in several other senses also fits the Siegfried per- 


spective better 

Our procedure has in some ways been affected by the nature of the 
data readily available, and we might have handled the problem some- 
what differently had we been able to start from more nearly raw voting 
statistics The information on hand was contained in Gogucl s 
Geographie des Elecdons Fran^atses, which presents in map form 
selected results from major French elections from 1870 to 1951 , thereby 
sweeping back across most of the period covered b> Masters and the 
period studied by Stegfned as well Because of the presentation »n 
maps with eight or ten tinting gradations, the original metric infor- 
mation IS destroyed, and the departments of the nation are, in effect, 
collapsed into ranks with multiple tics with respect to whatever voting 
variable may be at issue in the particular map * ^ 

Moreover, there is a fair amount of “missing data” m that maps 
treating the same party of tendance are not proMded for every major 
election, although there appears to be at least one map for every election 
Finally, there is something of an “apples-and oranges” problem, since 
the variable coded on some of the maps has to do not with the 


manipulations of the data we shall use later, although it lends to require a * representa 
tn'eness of the hrst election in a senes in a way that is unnecessary (Masters op ett 
pp 668-670) 

Franfois Goguel, Geographie des elections fran^aises de 1870 a 1951 Cahicrs 
dc la Fondation Nationale des Sciences Potitiques No 27 (Pans Armand Colin, 
1951) 

This collapsing has at least one beneficial side effect for our purposes In vi^ 
of the fact that the size of the mtervals coded and the median rank were adjusted 
to leave something of a rectangular distribution, a primitive normalization has 
already been earned out vihich diminishes concern one might otherwise have about 
the differential effects of uniform class intervals on returns for large and small parlies 
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votes for the Communist Pnrt> Withm each of these five malnccs. the 
entries were pure m the sense of representing the same entity correlated 
with itsclfat another point in time UneRae- 

These matrices resemble some that were involved in ” 

Mcldrum paper, yet as we want to ask quite a dilTcrcnt set ofqu 
of them, we shall treat them in a radically dilTercnt way. For simp 
Ity's sake, we shall proceed with the major part of the discussi 
referring to the more proper United States matrix (or any one hxc it) 
and postpone mention of the additional wrinkles that the Frcncn 
matrices imposed on us We may think of this matrix as representing a 
surface that has contours into the third dimension, according to 
relative sizes of the cocfTicicnts m diflcrcnl regions of the matrix Gi'cn 
the kind of matrix v.c arc dealing with here — organized by progrw 
sivcly larger time leaps — we would not be in much doubt as tov- a 
kind of surface in general wc arc likely to find That is, wc ® 

intuitive sense that many forces act o\cr time to decay structured re 
lalionships progressively, and the longer the time that such forces 
operate, the greater the decay to be observed from any given temporal 
starting point In other words, wc would surely expect such an inter- 
correlation matrix to show the “simplex” form a pitch like a roof, "dh 
the ridge running just off the mam diagonal (the correlations between 
successive elections), sloping off progressively to a low point at the far 
corner of the matrix, where the correlations relate the most distant time 
points What is interesting to us is not the fact of the slope itself, which 
IS nearly trivial, but the possibility of investigating the properties of the 
slope to arrive at more precise expressions concerning the rate of decay 
and its general form 

One simple way of proceeding is to consider all correlations many 
given diagonal of the triangular matnx as repeated estimates of an 
underlying ‘ true” value, representing the correlation between states for 
the particular lapse in time represented by that diagonal In this light, 
it is a natural impulse to average these successive estimates in order to 
arrive at the best cross sectional “slicing” of the roof, summarizing the 


Factor analysis would be a reasonable altcmaiive at a point or two in 
following inquiry but we have refrained from using it m mcw of our interest in 
keeping some of the intermediate data organization explicit 

More properly, perhaps I should say these would be our expectations where the 
voting data are concerned In some instances one would find matrices looking much 
more like a Chinese roof than a Western one for example with the h ~ fs 
relations generally greater than those for f, - fj or i, - u and those for ti 
generally greater than others intervening tn time all of which is a simple indicator of 
an underlying cyclical process 
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However, the presence of background or higher-order partials, which 
our null model rules out. would suggest a further dimension or 
sions of stability not apparent in the single parameter, r,j And, 
course, the presence of such partials can rather radically alter the lorm 
of the decay function, progressively retarding it as they increase m 
magnitude Indeed, for any there exists a first partial r,, 2 of wna 
seems intuitively to be a remarkably modest size (1 e , the range .3 • 

for r,2’s from .50-90), which is sufficiently large to render the decay 
function a horizontal line or independent of the elapsed time 
What would some nonzero partial mean substantively in the case ol 
voting data correlated over timc'^ The aggregated nature of the data, as 
usual, introduce some further ambiguities we must subsequently ex 
amine However, let us suppose we arc correlating successive votes over 
time for individuals in a system that operates along the lines suggeste 
by our original model That is, it is a system in which various short 
term forces induce quite visible rates of defection in any given voting 
trial, although most participants have some underlying preference for 
one of the two parties Thus while an individual may readily defect in 
one election and at considerably lower odds repeat his defection in a 
second election and still more rarely a third, there will remain a very 
strong ‘ homing” tendency even when defection does occur The cor- 
relation between any pair of successive elections will be reduced by the 
relatively ‘‘noisy ’ or largely uncorrclatcd defection phenomenon, but 
with additional elections, the underlying party fidelity will be manifested 
in the fact that is greater than (rj2)* This is another way of saying 
that there exists some nonzero partial Tjj 2» sud the magnitude of that 
latter partial could be construed as some positive function ofthe strength 
and prevalence of the underlying trend tow'ard continuity or stability, 
relative to the stability already manifest m r,2 Where the partial is 
actually zero, on the other hand, it would suggest the absence of any 
of the systematic “homing” tendency that is the essential feature of the 
defection phenomenon and the consequence of underlying party 
loyalties That is, for a system of multiple parties ordered A through F , 
the absence of the partial indicates that if a voter has shifted his support 
from party C to party D over two elections, there is no unusual prob- 
ability that he will return to party C m a third election, relative to the 
probabilities of remaining at D or passing on to £ or F 
The existence of positive second-, third , and higher order partials 
in addition to the first partial would suggest that some systematic traces 
of “homing” tendencies may take two or even more elections to assert 
themselves However, we would, of course, expect magnitudes to 
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won had enough observations for brief lapses of tune and constituted 
virtually all of the data points for the longest stretches of tunc but were 
too sparse for stable estimates (less than fisc observations) in the inter- 
mediate 20- to 40-year range Hence, correlations concerning seats 
were added at this point to fill out stable estimates across the range of 

time intervals i. ♦ «»,- 

We can make a few tentative statements about the effect that me 
addition of seat proportions has on our decay function estimate of 
Figure 1 Where there are sufficient observations to compare the two data 
sources meaningfully — primarily over the first third of the slope 
correlations based on seats tend to run lower than comparable correla- 
tions for vote proportions The difTercnccs are t>pically not large 06 
to 08 — save in the first two observations Elsewhere, there is a slight 

depressing effect exercised on the function by the inclusion of correla 
tions based on seat proportions, although scarcely strong enough to 
account for the differences between the United States and French 
functions 

It ts reasonable to assume that the conelations referring to seats tend 
to run lower because the measure is cruder and more inadequate than 
those referring to vote proportions However, there is another in 
tcresting possibility In isolating the decay functions originally, we had 
faced the question as to whether, in averaging data over a 70 ->ear 
period, we might not be concealing significant temporal vanation in the 
decay rate We were not particularly concerned w ith the fact that a stray 
election here or there might yield a rather aberrant estimate, as this 
could be quite easily understood, and we were deliberately trying to 
bring into clearer focus the most general picture of the decay rate that 
could be constructed, short term fluctuations aside However, it was 
entirely possible that some major secular trend m the decay rate had 
been occurring over this period which our averages would conceal 
We had suffiaent data to examine this possibility by splitting the time 
penod into halves (pre- and post-World War I) Our a priori expecta 
tions were that if any noteworthy differences by era were to be found, 
they would not occur on the American side Rather, they would show 
up in the French data and suggest that decay rates were most rapid m 
the early period — Sicgfncd notwithstanding — with something of a 
firming up process occurnng m the later stages After all, the French 
voting system was essentially a new one in the 1870 s, and there are 
theoretical grounds to suppose that a population becomes psjcho 
logically anchored in a firmament of democratic parties only as it has 
some expencnce with it over time As of the 1870 s the electorate 



481 


SURVEY RESEARCH AND DECODING PATTERNS IN DATA 
had part, cipated ,n the American 

Arexpeeted ='>her half of the Amen— x'o«d^^ 

essentially the same fnnetmn as ha j.p 

slight differences m the avotoBe „f ,he decaj function 

fcrences do not appear to be '“"‘'o ^ j ,ha„ those for 

in the 1872-1908 period are of dilfcrences that 

theperiod 191^1960 == Hos«ver reasonable 

average less than ^ „„ bodi sides) Hence neare 

(where there are at least five obse F,»ure 1 »onld not has e been 

quite satisfied that the resuhs sugB^ted , F ^ 
palpably different had we happened by chance 

time intervalstal.cn separately rullilled on the French 

At first glance our ,„deed visibly lower than 

side for values from the earlier F' j genuine trend d would 

those derived after World War 1 If Ihdw^^ „pp„„e of he 

be of great interest since the dir^iof Unfortunately the 

slight differences fo''”'* ^;„,ns referring to proporlions of Ka 

GoEuel coding is such that mcasur j Thu 

S almost Llusively from •^'^’r/lhtrmearthoscwehasealrad, 

the decrements m values of 0 ^ j„nes between scat ba 

examined m connection "hh the ^ able to eas ight on 

mates and vote based ones F correlations from 

the matter by asking ,,„a,cs of the same period o t^h ^^ 

period look more like the scat ,o male a 

estimates of the later Pnom the early penod o° 

does seem that the vote es 1 j,Po the later s . from 

contemporaneous seat esurw^^^ '“"''“nemfore rt seems 

However ‘here are all of ' „riui,ve . „r 3 nbe 

the earher period and "’'F o^il more adequate data ca 

wisest to leave the point a moot » 

mustered dressed is the fact that no" ^niencan 

Perhaps what j, if,., once between petmds n ^^r 

ntrons - the shght absolute oosed es"WO'« 

side or the larger difference be _ ^ '"““F ' „maicd decay 

vote based ones m Tcfferenees m enab »" 

conclusions suggested y jHling F“'' ' „id remain '«> 

functions would be minim baiches of data 

bo,hs,dess,neel916 a"d>rt'h''» 

clearly discrimimted from one ^^u^lr^ionuarijwe”"" 

rr For reasons to be menuoneabj^ ^^p-guret 

ated rtom the anabsw and from «■ 



482 


PHILIP E CONVERSE 


Indeed there are sufficient initial observations in the pure voting 
.Ltats ^oni France to give good ^.-tes of the b^inmng f • 
decay function (9 and 6 observations, 

mean values) and these first values would be 766 and 606,J^ot 
are the two points on which the seat-based estimates 
greatest depressing effect in Figure 1 If we took these as 
rather than those of Figure 1. we would eonclude ‘h= F™d, 
function started at a slightly higher point than the American ( 77 raffier 
than 70) but immediately declined below it at a precipitous ra e 
would then suggest slightly less “noise’ between adjacent pairs ol 
elections in France, perhaps a result of the responsiveness ol the 
American electorate to a sequence of rather disparate short-term fo 
from election to election But acceptance of such a possibility carries 
with It a corollary, which is that the background partials in the French 
case would be considerably weaker than Figure I suggests we have 
entered in column “France S’ of Table I the values that 
appropriate for the beginning of the decay function as indicated by the 
pure-voie materials In this case, the partials are trivial from the very 
beginning 

While the French data are too fragmentary for any more intensive 
analysis, the American materials readily sustain further dissection As 
v.e have seen, a “split half’ test indicates that our decay estimates there 
do not depend heavily on the era for which the time senes is drawn, m 
the period since 1870 However, the averaging of the initial matrix 
conceals contributions made by specific elections to the summary decay 
curve We can now imagine a smoothed matrix, in which the entriw 
alongeach diagonal are identical, and the average value for the diagonal 
Such a matrix represents a set of “expected values,” in an actuana 
sense for the American decay gradient And the matnxlhatis the di • 
ference between this expected matrix and the observed matrix gives us a 
residual matrix highlighting the differential contributions of the various 
paired election correlations to the summary decay function A negative 
entry means that the pair of elections involved taken alone would 
suggest a more rapid rate of decay of partisan structure than afiirme 
by the rest of the matrix A positive value indicates abnormally slow 
decay or a heightened estimate of stability 

Once constructed, there arc several alternate messages that the 
residual matrix might convey The positive and negative values might 
show no particular pattern at all over the surface, indicating that while 
there is inevitable variability m the decay estimates, it is too unsystem- 
atic across time to be of interest Or, again, there might be some 
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Wh.k no elections on the positive end of the 
sharnls as do those mentioned for the negatise extrem^ the three 
elections of the relatively quiescent ISSO's all ^ 

covered by postwar survey research, the largest positive value is shown 
for 1948, noted as a low-turnout, party-loyalty election 
Working from survey research materials, Campbell has associa 
surges in voting turnout with the entrance into the active electorate 
individuals whose attachment to the standard parties is much weake 
than that of more dutiful or experienced voters Conversely, m a 
American election of relatively low turnout, voting tends to be ten to 
the strongest party loyalists “ While realigning elections are typical y 
“surge” elections in this sense, noteworthy surges arc not necessarily 


accompanied by realignment 

If we classify the Presidential elections m our set according to their 
“surge” or “decline” properties by expressing the ratio of the turnou 
m each election to that of the average turnout among the eligible 
electorate in the next two preceding Presidential elections, wc find a 
product-moment correlation of — 55 between the two arrays The rnore 
marked the surge, the greater the negative contributions to the residua 
matrix, the more notable the decline, the greater the positive con- 


tributions to the matrix 

This is a very satisfying result, for it lends considerable weight to our 
general interpretation of the initial matrix and its implied deca> gra- 
dients That IS, had we limited our averaging to those elections showing 
surge properties, we would have estimated the rate of decay to be more 
rapid than that depicted in Figure 1 and would have inferred weaker 
levels of partisan identification in the population If, on the other hand, 
we had drawn our averages from those elections involving a relatively 
shrunken, hard core electorate, we would have found slower decay rates 
and inferred a greater degree of partisan stability Yet these differences 
in results would not have signaled any unreliability in our procedures, 
for we would, in fact, be looking at two different cuttings of the 
electorate Sample survey evidence based on individual data indicates 
quite strongly that these two cuttings of the electorate would differ m 


their average level of partisan attachment in exactly this way 

Thus we come full circle, and wc see that m averaging vve have avoided 
dealing with either an abnormally swollen or a shrunken electorate and 
instead are representing m Figure 1 electorates of average size for 


** Angus Campbell ‘ Surge and Decline A Study of Electoral Change,” Public 
Opinion Quarterly 25 (Fall 1960) pp 397-418 
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the one we shall drscuss here ■ It .s our task to discover these diverse 

models and to fit them together 

Even when a widely applicable method such as factor analys . 

It may have to be adapted m specific ways to the problem at hanu Th 
we shall speak, not of factor analysis in general but of a particula 
variant of that method designed to deal with voting m a Wo-P/irty 
system where the parties have continuous existence over time, nave 
stable groups of supporters, and undergo major realignments in their 
issue conllicts only once in about a generation These changes of issues 
are important politically, since they reflect the increased access of new 
groups to power, they redefine the real constituencies of parties and o 
their elective officials, and they define “gateways"^ that select and 
transform the issues of partisan debate 


THE "AL SMITH REVOWnON" AND THE NEW DEAL 
THE REORIENTATION OF 1928-1936 

One instance of a lasting change of issues — the most recent one 
that has occurred m the United States — is that associated with the 
Presidential election of 1928 We shall use it to illustrate the method m 
concrete terms and then restate the underlying assumptions more 
systematically 

There are relatively few elections m the United States m which lasting 
reorientations of the voting public have occurred Most Presidential 
elections have involved relatively uniform swings of states or counties 
toward one party or the other Louis Bean summarized this phenomenon 
in his chapter title, "As Your State Goes, So Goes the Nation But 
the occasions when this uniform swing does not occur arc of special 
interest because if the reorientations persist, they can mark the injection 
of new issues into national and state politics for a generation Lubell 
noted the importance of the "A1 Smith revolution ' which preceded the 
"Roosevelt revolution",* and Key, naming these phenomena critical 
elections," went on to show that Bryan s candidacy in 1896 marked an 

‘ See M J Kessciman The Ambiguous Conseitsia A Study of Local Government 
In France (New York Knopf 1967) pp 27-34 
* This term was introduced in B R Bcrelson P F Lazarsfcld andW N McPhec 
tomw(Chicago University of Chicago Press 1954) p 209 

Bean IIom to Predict Elections itiew YotV. Knopf 1948) Chapter 10 The 
follovsing accoums of the reoncniaiions of 1928 and 1896 arc adapted from D 
MacRac Jr and J A Mcldrum Crilica! Etcclions in Illinois 1888 1958 
American Fohiical Science Reviett 54(Scpt«nbcr I960) pp 669 683 
‘S Lubell The Future of American Politics ifievi Yorii Harper, 1952) p 35 
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expect the two factors mentioned previously — for elections before and 
after the cr.tical one -to be highly correlated For this * 

reorientation involved in a critical election can be seen more 
constant or “traditional” component of the vote is rernoved This can 
be done by setting aside not only the average vote (such as percent g 
Democratic) m each election, as would ordinarily be done in calculating 
correlation coeflicients, but also the average vote for each geographical 
subunit Thus m Illinois, after subtracting the election average, ^ 
further correct the percentage Democratic for each county y su 
tracting that county’s average residual percentage Democratic rom 
all residual percentages for that county The result is that the irs 
principal component of what remains is a single dimension distinguis 
ing (in positive and negative terms) between the opposing tendencies 
related to the critical election If a critical election actually occurre in 
a given period, this would be revealed by the temporal pattern an 
magnitude of the loadings on this principal component, there would e 
a senes of positive loadings followed in time by negative ones, or vice 
versa 

If this procedure is earned out, the results can be arranged in a 
rectangular matrix where each row corresponds to a county and each 
column to an election The clement in the ith row and yth column wil 
then be 

where 


£),/ = the initial percentage Democratic for county/ in election^ 

Di = the mean percentage Democratic for county / over all the 
elections being considered 

J5 ; = the mean percentage Democratic for all counties at a given 
election 

/5 =: the over-all mean percentage Democratic (for all counties 
at all elections considered) 

Let this data matrix be referred to as X and its transpose as X 
Wc then compute by “direct factor analysis” the first principal com- 
ponent of XX' and at the same time of X X^^ Loadings on these 

” Tins procedure can be carried out by means of a FORTRAN program DFAI. 
for the IBM 7094. the program and opcraimg inslructtons arc available al mailing 
cost from the Social Science Program Library Computation Ccnicr University of 
Chicago References on the maihcmaiics of the mcihod arc M A Woodbury R C 
Clclbnd and R J Ihckcy Applications of a Factor Analytic Method m ihc 
Prediction of Biological Data Bfhmioral Science S(Aprill963) pp 347 354 . and 
P Woi^x.rattor Amilisisaf Data Stamces{Uc^yo\V. Holt Rinehart &Winston, 
1966) 
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a^lyfiS alone Ihus tells us that a lastiej reonenlalion of the 
‘ ’> cannot tell just hou the issues changed Changes 

n„A”V!r“ ^ mfered from study of the campaigns themselves 
Of the socioeconomic and political characteristics of the counties 
!u\a^o’^^ direction or the other Jt has been shown that 

the !92^}936 reorientation in Illmois fa\ored the Democrats most m 
urban areas that also favored the repeal of the Eighteenth (prohibition) 
Amendment and that those areas which mosed rclatisciv toward the 
Republicans were more rural and most opposed to relaxing the law 
regarding alcoholic be> erages Smith’s Cat holicism. though Lndoubtedl> 
an important factor m 1928 had less effect m the longer run, the later 
consolidation of the Democratic Party s support, which Lubell called 
the “Roosevelt revolution.'* seemed more related to class and ethnicity 
than to religion We therefore call a positne loading on this factor **New 
Deahsm ” And two splits in the Republican Party — m J9J2 and 1924 
— seem to have reflected dissidence m the same areas whose lote later 
tended to go over to the Democrats 
We can also see from Rgure 2 that the votes for different offices 
behaved m different ways especially during the period just before and 
after 1928 The candidacy of AI Smith m 192vS went far toward re- 
orienting the Presidential vote and some of the contests for Governor 
and Senator even ran ahead of the Prcsideitml vote On the otherhand 
the vote for Trustees, and especially that for Congress lagged behind 
the Presidential vote in 1928 and 1920 This lag suggests that the capacity 
of a Congressman to adapt the meaning of his partisan affiliation to the 
political situation may stabilize the party vote for Congress as it seems 
to for local offices as well On the other hand, the Trustee vote may 

reflect a limited association ofvtraight ticlei voting wilhloca! traditional 
party identification, even though the straight ticket is also affected by 
the personality at the head of the ticket Because these changes took 
place over several elections, we speak of a “cntical period ' rather than 
Simply a critical election 

The eencrahty of this rconcnution may be shown by data for 
Michigan and Neiv Hampshire. o.er « «milar penod. ai m Figure 3 

• ♦C*- V r» lef^ Jr. “Patiisanshm aad County OlTtx The Case rf Oh«.\“ 
a70unet9S)} r S2« M Vtoos ru.ufrm 

^2!iTSS^'oS(lUWniorr Johm Hopt.w Press, IWl rf 
» C ‘ T rTmi^Jl and W C Miller The Mtwoaiwial Ritis of Sira^fM a-sd 

SphiSrtSnf 2/iJ«iici9ST. rr Je 
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In ih,s case only thirleen Presidential elections are anal) zed but the 
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3 The areas m question must be of the proper stze and ™ 

reflect the reonentat.on tn question If, for example, tte 
IS based on social classes, then the areas chosen must be small enough 
to differ from one another according to the residential patterns ofsoaal 
classes If it is based on urban rural distinctions, the areas must diner 
in this respect If the reorientation realigns broad regions relative to o^ 
another, then the total sample of areas analj'zed must extend sut>- 
stantially across these different regions One shortcoming of our analysis 
of Illinois counties is that Cook County, which contains Chiwgo and 
includes a large proportion of the state’s population, is counted as onlj 
a single unit Differences m voting behavior and realignments within 
Cook County are thus not adequately revealed by this analjsis 
4 It IS assumed that the social and economic composition of the 
areas in question does not change radically over the period in question 
Our procedure has been to compute values of the u’s, or area factor 
loadings, and to compare them with census charactenstics of the cor- 
responding areas But there is no provision in the anal) sis for change m 
these census charactenstics U is true that our method would reveal 
electoral reonentations among aggregates df persons that suddenly ex- 
changed certain groups of members at a given time, but the normal 
processes of settlement, urbanization, suburbanization, and in- 
dustnalization seem different m kind from those we are seeking It is 
possible, however, that the steadily rising curve for New Hampshire m 
Figure 3 reflects an increasing social difference among townships 
5 The period of lime chosen for analysis must include one and only 
one lasting reorientation of the electorate and a substantial number of 
elections both before and after that reonenlation For if the positive 
and negative factor loadings are not sufiicienlly represented in the data, 
the reorientation cannot be expected to emerge as a pnnctpal com- 
ponent In turn, this pnncipal component must be sufliciently pre 
dominant to preclude the problem of rotation to preferred co-ordinate 
axes If there is more than one reorientation {as we shall see later for the 
Civil War period), or if the major reorientation is so weak as not to 
dominate other changes m the vote then we can have correspondingly 
less confidence in our interpretation of the first pnncipal component 
unless wc can support it with other mformalion Thus the need to 
choose the proper period prevents the method from being purely 
‘ blind.” unless the rotation problem can be solved 

6 If all the preceding conditions are satisfied it is still necessary that 
for each geographic subarea studied the vote at a given election can 
rcasonabl) be represented as the sum of several components 
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of the Status of slavery m the tcrriloncs The subseq^t controvers) 
affected every political organization in the nation The Rcpubli 
Party vsas formed in 1854, in the same period the Whig Party, thel^ormcr 
major opponent of the Democrats, disintegrated. By 1856 ihc Kc- 
publicans were in a position to wage an aggressive campaign, and the 
situation was favorable for a significant shift m political afiiliations 
The Democratic Parly, while it persisted relatively unchanged in 
Illinois, divided nationall) into Northern and Southern wings In 
addition, two substantial minor parlies, the American or Know- 
Nothing Party and the Free-Soil Party, rose to temporary success Our 
analysis of the period is therefore complicated by the problems of a 
multiparty system 

Preliminary investigation (see Figure 1) leads us to expect a reorienta- 
tion in 1856, and we therefore extend our senes as far back as we can 
before this time It is not practical to go earlier than 1848, for before 
that time the rapid early sMtIement of Illinois was causing repeated 
changes in county boundaries, and the character of the settlcmciU 
was less stable with regard to its presumed political traditions A 
closer look at Figure I reveals a second peak for Illinois at 1864, and 
we shall indeed find that this year corresponded to a distinct reorien- 
tation 

For this period we use the Presidential vote together with the vote for 
State Treasurer in non Presidential years, omitting years for which the 
data are inadequate Factor analysis gave rise to two principal com- 
ponents, as shown in Figure 5 The first component shows a clear 
transition from positive to negative, but this occurs m 1864 rather than 
m the year expected from national voting, 1856 We thus consider the 
second component as well, and this comes nearer to our expectations, 
with a change of sign in 1856, the signs thereafter remaining negative 
until 1872 The curve for this component, however, shows an upward 
trend from 1858 on 

We thus confront several new problems in explaining these reorienta- 
tions We must account for the fact that the first principal component 
did not correspond to the major national reorientation of 1856 as 
expected At the same time, we must try to choose a set of preferred 
axes among those that might be obtained by rotating the first two 


” We simply use percentage Democratic in our analyses grouping all other part*** 
logelher We omit the off year elections of 1846 1850, 1854 and 1874, in each of 
which one of the major parties was missing from at least ten counties in the voting 
statistics 

This IS a general problem of factor analysis, sec Harman, op eii Chapter 12. 
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One method of choosing preferred axes is through inspection of the 
plot of the county loadings (ids) If we plot the county loadings in a 
two dimensional space corresponding to these two principal 
we find that the distribution of loadings on the first differs from that o 
the second On the first principal component, the ten highest loadings 
ate all positive all correspond to a contiguous group of counties in tne 
southern tip of the state The loadings on the second principal com- 
ponent on the other hand ate symmetrically distributed about zero 
This contrast could not be accentuated by rotating the two components, 
so we interpret them without further rotation 

Closer investigation reveals that there was indeed an electoral 
reorientation in 1864, but it was restricted to southern Illinois, it was 
lasting, but not even state wide in scope A small number of counties 
swung to the Republican side in 1864 and remained there These 
counties were contiguous and located largely in a single Congressiona 

district Before 1860 they had strong Southern ties and sympathies an 

had consistently given overwhelming support to the Democratic Party 
After the outbreak of hostilities, however, they rallied to the support 
of the Lincoln administration and contributed high proportions of 


volunteers for the Union army 

Apparently instrumental in this shift was John A Logan, who 
represented this district in Congress from 1859 to 1861 His political 
position at the start of the war was equivocal, but in the summer of 
1861 he resigned his seat in Congress and accepted a commission as a 
general in the Union army His former law partner and successor in 
Congress, William J Allen, left no doubt concerning his Confederate 
sympathies, his opposition to the Lincoln administration was so 
strong that he was arrested and imprisoned during the war In 1864 
Allen was a candidate for re election on the platform of the Peace 
Democrats Several months before the election Logan was granted a 
leave to permit him to campaign for the Union ticket and for War 
Democratic candidates The issue of loyalty to the Union was probably 
more clear cut and dramatic in this contest than in any other in the 
nation Logan s extensive and aggressive campaign was undoubtedly 
a strong factor in the reorientation of the vote in this area 


A Vanmax rotation changed theaxesby only 13dcgrees 
** For documentation on these events in addition to histones of the period sw 
llllnoij Stale Journal iSpnn^Md) July 7 October 26 November 15 1864 J G 
Randall and R N Current Uncoln the President Ear/ /■«// A/eorure (New York 
Dodd Mead 1955) p 240 ini Dictionary of American Biography Vol I PP 213- 
214 (Allen) 
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This analysis should not only throw light on Illinois politics but also 
suggest possible extensions and modifications of the method used Some 
ways in which the method might be made more useful are as follows 

a Studies of other countries, other states, and other geographical 
subdivisions can help to confirm or correct some of the hypotheses 
formulated about Illinois counties 
b Extension of the analysis to further principal components beyond 
the first, together with careful study of how these can be inter- 
preted, may reveal other significant reorientations of the vote 
c Transformation of the variable “percentage Democratic” might 
be earned out to correct for the decreased variability of the per- 
centage Democratic (hat necessarily results when it approaches 
zero or 100 per cent 

d Regression analysis can be used to test directly hypotheses about 
population groups and the vote, which can be only explored and 
formulated with pnncipal component analysis 
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from the beginning of parochial registration in 1 538 until the beginning 

ofcivil registration in 1837 a.nils 

b. Family reconstitution, in order to complete demographic details 

for selected parishes o\er the same period 

c. Discosery and analysis of census-type documents earlier than 
1841, when the English Census began to gise adequate social structura 


inlormauon , 

d. Collection of ecological parameters for cver>' parish lor v^nicn 
demographic and social structural data arc being recovered 

e. Development of computer programs to carry out family recon- 
stitution more speedily and with much greater vcrsalilily. This is pro- 
gressing toward a general scheme of record linkage using a wide range 
of data other than the parish registers (poor law records, wills, etc.) 

/. The preliminar>’ stages of a nationwide study of the extent o 
literacy. The evidence will be the signatures in the parish registers back 
to 1754, when this practice started in England, and the records ofe^W' 
astical courts earlier than that period back into medieval times. These 
two sources will be supplement^ by any other source that can be shown 
to be capable of >ie1ding statistically reliable results 

g. Correspondence with several hundred volunteer workers all over 
the country who are consulting local documents on behalf of the Group 
for the purposes specified, especially the counting of entries in parish 
registers, together with signatures and crosses. 

h. Editing and publishing a scries of works containing the results of 
the activities listed so far and presenting descriptions of historical 
demographic and social structural techniques.* 


All these activities^ are likely to show up historical regional variations of 
a quantitative ecological character. Progress made so far with the first 
four of them will be successively described. 


bet5^een nations and cultural areas A number of results have been published It has 
not been possible to emend the present article to take account of these developments 
except in a veT> few ;articulan 

* Techniques relevant to activities o. b. c. and rfare described m the first volume o* 
an ongoing senes entitled An Introduction to English Htstoneal Demography. 
edited by E. A. Wngley. It contains a contnlmtion by him on family reconstitution, 
one by D E Eversley on aggregative analysis, another by Peter Laslett on the 
anab^is of census-type matenals before 1841, and one by A Armstrong on census 
materials from 1841 on, published in February 1966 A volume descnbing technique 
for the nineteenth century is m preparation For literacy, see R. S Schofield m J 
Goody, ed . Ltleraey m Traditional Societies {!C^mhna%,c, 1968) 

* It would be wrong to imply that this is (be only work relevant to this Symposium 
going on in Great Britain. Scholars are «igaged in the history of population and 
social structure at the universities of Leicester (where there is a professorship 
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Within these important limitations it is possible to make the following 
claim for this material for the purposes of historical regional comparison 
in England 

Every regional variation llial consists in or can be correlated Kith the 
manllil} numbers of baptisms marriages and burials registered by the 
priests of the Church of England betneen the early seventeenth land 
perhaps back to the mid sixteenth) to the early ntneteenth century can in 
principle be examined tn numerical terms 

The importance of this claim has to be judged m relation to the fact 
that this was the period of industrialization m England, the earliest 
country to be industrialized 

The work of calculating such regional variations, even the work of 
decidingwhether the population was growing ordechning over-all and at 
what period, has not yet been begun by the Cambridge Group The 
sample of 5 per cent or more of the Parish Register entries is in our 
hands, but at the moment we are engaged in tackling the formidable 
statistical problems presented by the material, which is all in machine- 
readable form 

The variables that we intend to examine in relation to demographic 
fluctuation in English settlements include area, height, distance front 
London, whether or not a market is present, whether or not the parish 
IS maritime, status of parish (county town, parliamentary town, etc), 
date of enclosure, farming (subdivided by pastoral, arable, mixed 
farming, market gardening, etc), occupational structure (purely 
agricultural, agricultural with woolen industry, mixed industrial), type 
of industry other than textile, distribution of land, whether known to 
contain Roman Catholics or Dissenters, whether known to contain a 
seat or seats of nobility or gentry This list may be extended even further, 
though certain items may not prove to be practicable We are also 
considering some indication of the level of prices for each year, if 
possible a regional one, and perhaps indications of climate, war and 
peace, and so on 

In the present situation, therefore, only impressionistic answers can 
be made to questions about the contents of these data I shall mention 
here only our first impressions of probable answers to simple historical 
questions, for we have concerned ourselves with these more than with 
historical regional cross compansons It seems likely that English 
population did increase over most of the country from the earliest time 
when V.C can get figures until the mid seventeenth century After that, 
there followed a very interesting period when population was static or 
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their relation to indices of increase or decrease in population totals, 
fertility, nubility, mortality 

b Impact of contagious diseases as distinct from and as correlated 
with a 

c Effect of size of community, especially rapid increase recognizable 
as urbanization, on such demographic indices 
d Effectiveness of controls (ecclesiastical and general social) oscr 
sexual and other behavior revealed by such indices as seasonality 
of conception, seasonality of marriage, illegitimacy, and prenuptial 
pregnancy 

This list could, and no doubt will, be considerably extended There are 
two further points to make here about aggregative analysis* 

a It provides essential extension, confirmation, and material for 
critical appraisal of the results of family reconstitution (sec next 
section) 

b In general, along with family reconstitution, it makes possible the 
use of all the techniques of the soaal science of demography to deter- 
mine regional vanables in England as far back as the mid-sixteenth 
century 


FAMILY RECO!>/STITUT/OS 

The whole process of family reconstitution has been carried out 
successfully for one English parish, that of Colylon in Devonshire, by 
Dr E A Wrigley, cofounder of the Cambridge Group We have 
available therefore at Cambridge exhaustive demographic detail, 
accurately reckoned and equal to or even exceeding the standards of 
contemporary demographic information, for something like a fifth of 
the families living in this one village from 1538-1837 These details 
include age specific fertility rates mortality rales, age at mamage, 
expectation of life at various ages, size of completed families, age at 
birth of last child m completed families, and so on, all broken down 
into subperiods of some fifty years Reports of the results of this study 
are now m progress from Dr Wngley himself, and copies of his 
first full length article* will be available at the conference This article 


^ /anniy Limitation m Pre Industrial England ** Economic 

S<^dSen« 19 UApnl 1966) pp 85-109 The methods used are 
•nterestcd may be referred to Wngley s ov.-n 

19 fis M International Economic History Conference m Munich 

1S65. entiUed Some Problems of Family Reconstitution Using English Fzmh 
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Jimjlatjon during {he period already referred lo m the precedine 
section as peculiarly importam for Englisftdih.fopment (and .LustnaN 
umg social development generally), that is. the period from the mid- 
eighteenth centuries This is the earliest demon- 
strated example of family limitation affecting a whole premdustriaJ 
community, as distinct from a social group such as the nobditv of a 
country. ^ 

The elaborate worl of data preparation, done manually by volunteers, 
is complete for another Devonshire parish, Hartland, but though the 
fmpwr/Ofl gnen is that the figures may confirm those for Colyton, no 
result IS yet available Somewhat the same situation obtains for a very 
different and distant parish. Easingwold m the Vale of York Recog- 
niRing (he fundamental importance of checking the Colyton results 
against parishes of different economics as well as recognizing the 
absolute value of all such studies, the Cambridge Group is now 
undertaking, supervising, or encouraging similar projects — none as 
yet advanced as far as that for Hartland — at Banbury m Oxfordshire 
(a midland, hand-industry, and market center), St Leonard's Shore 
ditch, London (for urban-rural comparison, the difliculties are formid- 
able), and a group of parishes in Bedfordshire (very important so 
as to extend the proportion of analyzable families and to allow for 
migration) * 

All the possible local, regional, and historiral variables that werelisted 
as suitable for study by aggregative analysis of parish registers can, of 
course, be examined for the small sample of parishes that will be re- 
constituted But in these cases we shall be able to observe the vanations 
in the actual behavior of individuals, which m the case of aggregative 
analysis is only very generally and vaguely indicated fay changes m total 
per month or year It can be recorded, as one more impression, that 


RegesferAfarmar Or Wngfermsietptim his debt, and the dtbt or the Cambridge 
Group as a wbote lo iJic French historical deinosrapher? for ilie meihod he has been 
using and espcciafly (o Louis Henry and ha colleagues at the Insiiiut National 
d Eludes Dimographiques m Pans (The resuhsof the ^onsiitutw. at Colyton for 
fertility and life expectancy were pubhsfwd byE A Wngicy m J)Ms. Spring 

* Since the preceding was wniten, ihc fwwpect of rap'd and efficient reconstituiion 
of a tong list of addnc^al English parBhes fay the use of 
transformed the siiuai.on ti remains true hower, that f 

reeisiers must r^trici oor aciivilrfs a greal deal more than we should liite And we 
shall «ve7 be inlTiosmon of our colleagoe, .a France wtih respect lo lh« crucial 
undertaking. 
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vihcrc the aegrcgalive figures for any community follow the seo' 
genera! historical pattern of population change m England. »hi^ «« 
described earlier, the detailed demographic mechanism could be lii.e 
that obsersed at Colyton. which has a demographic histoo’ betjsccn 
1539 and 1837 of the "normal" type. It must be repealed that only by 
the combination of aggregative analysis and reconstitution can demog- 
raphy be put at the disposal of the historical study of regional variation. 

Each parish will have its individual limitations and drassbachs. and 
none will yield certain classes of figures which social scientists wou d 
like to have. These include suicide rates, abortion rates, rales of sttll- 
births, and so on. But migration figures, as obtsiincd by studying place 
of origin of marriage partners, will be asailabic as a matter of course. 
Where the registers yield occupations (as they do at Banbury but, 
unfortunately, not consistently at Colyton or at Hartland), social mobil* 
iiy by marriage can be studied, as well as difTcrcntial fcrti!il>e 
mortality, and migration by occupation. 


DISCOVERY AND ANALYSIS OF CENSUS-TYPE 

DOCUMENTS FOR SOCIAL STRUCTURAL PURPOSES 

It has already been pointed out that the comparative analysis of social 
structure from complete and ofTicial census material can be done only 
from 1841 onward.’ The whole previous period, and this goes back 
further than the parish register era (indeed, it has no obvious historical 
limitation), can be examined only in documents of a similar kind made 
for a variety of purposes that were only seldom sociological. When work 
of this kind was first contemplated in the late 1950*s and early 1960's, 
the general impression seemed to be that such documents were rare — 
so rare that they would yield very few statistically viable results and 
certainly not much in the way of regional, historical comparison. 

This may, perhaps, still turn out to be the case. But the numbers and 

’ In fact, only the 1841 and 1851 census material seems to make possible the full 
analysis of the structure of families, households, and communities which wc stnsc to 
obtain for earlier documents, and after 1861 ihe British census cannofbe inspected, 
because of the extraordinary hundred-year rule enforced by our Registrar General. 
Even the 1841 documents have formidable difncuUies.Il seems possible, then, that 
full historical examination of social structure, at least until late in this century, may 
be confined to the material mentioned in this article and footnote At least, we shall 
soon be permitted to photograph census returns later than 1861. but with names of 
persons blocked out; thus all kinship analysis will be made impossible It is unfor- 
tunate that the original material of recent postwar censuses in the country has 
appa^tly been destroyed In En^and, therefore, the long-term possibilities for 
social structural history do not at the moment look good. 
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Proportion of households with resident children of mature age 

(over sixteen) 

Proportion of each age group married and single 
It can therefore be stated that 

£icO xanabic in the preceding list, and exer} other xariable that could 
be correlated ii ith an) one or an) mnnber of them » Inch ts of importance 
for historical-regional analysis can in principle, be reeoxered for this 
gro\ung sample of English communines before 1841 

Once more, it must be said that the work of anal>sis has >ct to proceed 
to the stage v^here numencal cross comparisons can be published svilh 
confidence But we have alrcad> committed ourselves in public to the 
following general assertions about this body of evidence The mean size 
of household in England in preindustnaJ times, and at least as far back 
as the seventeenth century, lies between 4 5 and 4 8, showing no marked 
tendency to var> with time In any one communit> and in the popula- 
tion at large the size of household varied direcll> with social status, the 
higher the status, the larger the household This variation is due almost 
entirely to the distribution of servants between households, and sen ants 
were something like 10 per cent or more of the total population For the 
rest, households consisted of the nuclear Tamil) ‘ * m all but 10 per cent 
of the population sur\e>ed,andthecxtcndcd ormuliigcnerationa! family 
is rare in preindustrial England The proportion and distribution of 
servants, the size of the sibling group, and the numbers of mature 
children resident m the parental household vary with occupational 
structure and will probabl) be found to vary also with degree of 
industrialization 

To these statements can now be added two further possibilities The 
first IS that in England the tendency may have been for the “extended 
family group” to become more rather than less common as industrial- 
ization and urbanization proceeded Resident widowed grandparents, 
visiting and lodging relatives, shanng of households between parents 
and married children certainly seem to have become commoner after 

The points made m ihc text are examined in detail in the forthcoming publics 
tion referred to in the precxding footnote In the sample of one hundred parishes 
multigeneralional households make up on ihe aserage only 5 7 per cent of all house 
holds and 11 7 per cent is the mean proportion of households uith resident kin of 
any kind 

“That is the conjugal family with or without servants one third of all households 
had servants The evidence confinns and gives histoncal extension to William S 
Goode s conclusions in iVorld Rnobitum and Family Patterns TheFrec 

Press, 1963) 
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■he early nmcleenlh cemuo. The second .s that the nuclear family in 
England since medioaJ iim«, at least, may be correlated wnh what 
Hajnal has called Die • European Marnagc Pattern ’ (higher mean age 
ol marriage, higher proportionofihoscnc\crmarricd)’^and so may be 
contrasted wiih a fendenej toward (he extended family m Eastern and 
perhaps Southern Europe, Russia, India and other parts of the world 
Though this material offers multiple opportunities for regional com 
pansofis Within England and Britain, we arc once more interested in 
Continental and world comparisons, as wdl as in Jong term historical 
comparison Morcoscr. the demographic evidence already described 
ties in With the social structural evidence under discussion It must be 
remembered tliat for c\ idence of the latter type, and to some degree the 
former type too wc have m England the great advantage of the work 
done m preindustrial times by contemporary political arithmeticians, 
especially that of Gregory King 


RC,\fAlNDCR OF WORK OF THE CAMBRIDGE GROUP 
AND CONCLUSION 

To (real at similar length e\cn m (he very superficial way that has 
been adopted here, the rest of the activities of the Cambridge Group 
Nvhicli bear or will bear on historical interregional comparisons would 
require a great deal of work It is obvious how the development of 
computer programs will mcrease the speed at which these results can be 
obtained The cross comparisons of the material from the parish regi* 
sters and the census-type documents that will iben be possible will 
transform (he subject A body of theory and practice will also, we 
hope, come into being and eventually become standardized 
A word should be added about the mechanism of data gathering and 
prchminaryproccssmg m the field by volunteers There are 260 men and 
women, living all over England, with whom we bad (by 1966) made 
contact by circular letters by orgamzational channels, and wme 
extent by (he use of broadcasting A tnbafe is perhaps due to the T^ird 
Programme of the BBC. which, with Vie Listener for the publication 
m print of its addresses, is an organ probably unique m any country 


Itajnal ' European tbnerm m 

(Chicago AWme. 1965) 

For an inirodocijon fv.iih ,.^,5 -nt ffortJ IVe Nave Ian (Nev/ 

of these remarkabte researches, see Peter Laaei 
York Scribner, J966) 
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But the 165 local historians among our correspondents must have their 
equivalents in other European (or American or Asian) countries, and it 
IS significant that their introduction to us has come through local history 
societies, archaeological or genealogical societies, archivists, or clergy- 
men University teachers have introduced 50 to us, and tutors in 
extramural adult education 20 more, only 25, perhaps an unexpectedly 
small proportion, are schoolteachers 
This body of volunteer correspondents is, as the reader must surely 
have recognized, a research tool of extraordinary value in this field, and 
we have been astonished by the speed with which we were able to bring 
It into being and with its flexibility when created This is the only 
important contribution to methodology that, m these early stages of 
the work of the group, the present paper can claim We are now begin- 
ning to use this research organ to study the issue of the history of the 
growth of literacy m England over the crucial generations of in- 
dustrialization When we also begin to analyze geographical mobility 
and occupational structure, all from parish registers and other locally 
available documents, wc hope to extend greatly our available infor- 
mation in numerical and other forms for the purpose of regional 
historical comparison 



PART SIX 


The Oriianization of 
Ecological Data Archives 

O 

the tintilj /« presented in this t<;/uin^ are basiJ an collcctmr of data erganued 
hj some sort loeaUt^. Some ^ efi«e eolUetiont ate simply ^ri*ate data jiks hudt 
up Jot some spieijie task of onolrsis, in tetsam easts the anginal fie nill remain 
ns personal and inaeeessihle as the notes taken m the preparation of the resisreh 
report, In others the file in// sooner or later he incorporated mt/i otAcp flet in a 
broader erehlse and placed at the disposal of ^ooUfvd scholars for reanohsis and 
reinterpreiarion The Inocnthe to "go puhlie" is mcrkedly stronger in the ease 
of eealogieal data than in lAe ease of eo* data from such costly reteaixh operatiane 
os te«j or surie)s After all, most of ffic data lo the majonijr of the eti5/in^ eeo- 
logicol files are simplj' lifted out ef oIreoJj published i^ciol stotiaics, and there Is 
no nay of prerenting othet scholars from duplicating ntch eforit ef data hnloge 
Obviously, this does not mean that scholars mil he willing to tpread thetr data 
inJiscrIminaielj lo all comers, there ore bound lo be presnins lo protect the original 
Imestmenes and to secure priority rights m the preparation of pubheotums But 
the trend Is ne< ertheUss guiie clear with (he sprex^ of high letcl computer facilities 
to more and more research centers there w frounJ to be a mushrooming of efforts to 
assemble the long baeklogs ofeeulagleal statistics on cards or tapes and to establish 
scrrice oriented archives of one sarsc^ or another 

The fve ehaptas included in Pan Sic of ehst volume all deal m their various 
nays n,th the deichpment of streh archives Warren Millet, already the rider 
statesman of the archival maremem In the United States, renciis the esperumret^ 

the Inter-Uniierslty Consortium in the fridtf ecological data fbvaine Uarrkk 

and his CO ivorker Jane Bayes present infotmatton on the nojs m M ecofogiccl 
data from o^aal sourccs'bart beat linked enntecaially uith survey data in the 
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nork earned forward at one oj the most aeiisc of the local archives in the United 
Stales- the Falk archive at the Unnersit^ of California, Los Angeles The next t*o 
chapters review experiences and discuss strategies in the organization of ^szematic 
fles of ecological information for tno countries with strong traditions of official 
bureaucratic bookkeeping. Germany and Austria. The chapter by Hartenstein and 
Liepeh IS of particular importance because of their emphasis on the linkages of 
ecological data archiving nith urban ami regional planning This points to 
an exciting convergence in current trends tf research: the social or political 
ecologistfocusing on local sanations within the total national territory finds himself 
increasingly attraaed bj the stjle of model construction and trend extrapolation 
characteristic of the physical planner and the regional scientist. There are im- 
portant diferences in the choice of territorial unitr, but the variables covered and 
the stjle of anaijsis tend to become increaiinglj similar. Most sociologists or 
political scientists who hate made fforts to develop ecological data fles have felt 
bound to stick to the smallest units ^ local administration, at least in the countrj- 
side, in the cities the tendeny hat been to focus on whatever is the equivalent of 
the census tract. To geographers and planners the interesting variations tend to 
occur ct er<n lower levels of territorial aggregation the Swedes are establishing 
a grid co-ordinate ystem for plotting census data, population movements, land use 
information, etc ,the Sorwegians, the British, and the Americans ere expenmenting 
with similar stems, and the Germans hope to build up a block ^stm with identi- 
fying co-ordinates. This isan exciting frontier of interdisctplinaij research todaj — 
again a feld where the International Social Science Council might serve an impor- 
tant function in linking national ^orts and spreading technical know how from 
the inwFctjn^ to the Jess advanced centers The fnal chapter, by the Argentinian 
sociologist Jorge Garcta-Bouza, helps to identif some of the important problems 
in extending ike archival movement to the developing countries of Latin America. 
The International Social Science Council has been very much interested in promoting 
archival developments in all countries and regions where there are si^cient backlogs 
of ecological data worth reanaljsis- major conferences on regional disparities and 
local variations in growxh rates have been held in Buenos Aires in 1964 and m 
L'lewr Delhi in 1 967 and will be followed up through concrete studies of the available 
data resources under contracts with USE5CO. 
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The Development of Archives for 
Social Science Data 
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Organizing an extended set of election returns or consus reports for 
subsequent work with a computer m-iy well be prohibitively expenswe 
if a single piece of research must bear the entire cost. The demand for 
data resources of a scope and a quality commensurate with long- 
standing intellectual ambitions has grown as the creation of such 
resources has become technically feasible Financial resources necessary 
to support significant social research have not grown at a comparable 
rate Scarcity of funds to bear the cost of creating data resources has 
thus added impetus to the extended sharing of those data that do exist 
Out of the need for data, the high costs, and the technical feasibility of 
sharing data has come the concept of a social science data archive 


RESEARCH CENTERS AS SITES TOR 
ARCHIVAL DEVELOPMENT 

The history of modern archives of machine-readable social science 
data IS so brief as to challenge immediate attempts at generalized 
description or commentary Interest in these innovations by scholarly 
organizations has swelled astoundingly, but the number of such archives 
m actual operation is as yet very limited Although we may envision a 
world-wide community of scholars and teachers sharing the world s 
entire store of data, our knowledge of the archival enterprise rests 
on our limited experience with the handful of archives actually 
existing 

From at least one perspective this is a curiously impoverished state of 
affairs The essential components of an archive are few m number, only 
a subset of the components of a self-sufficient research installation and 
not substantially different from those of a single large research project 
Indeed, more than one presently recognized archive came easily into 
being (if not into operation) with no addition of equipment or personnel 
beyond the complement already assembled for research purposes Given 
a collection of data needed by distant colleagues and given a desire to 
respond to their requests for assistance, an internally oriented research 
operation can rapidly acquire the perspectives of an externally oriented 
data archive If the research operation has been supported by an 
appropriately organized computer installation, the researcher’s com- 
mitment to collegial CO operation may be stimulated by associates who 
appreciate the potential role of the computer in carrying out the tasks 
of data storage, processing, and retrieval so essential to an efficient data 
service organization The tradition of intellectual exchange within an 
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FOUNDATIONS OF ARCHIVAL GROWTH 

Although research scholars provide the impetus for archival develop- 
ment, the> are reluctant to accept responsibiht> for specific archival 
activities Unless the creation of an archne promises to be the onl> way 
of making a valued piece of research possible, most researchers are 
profoundly uninterested in abandoning or diminishing their own work, 
even temporarily, in order to become research administrators for other 
people's research In some waysihisisas it should be The scholar whose 
foresight and imagination permit him to be excited by the promise of 
archival development should remain unencumbered and free to devote 
his imagination to scholarship And yet if the individual decision always 
favors the individual’s immediate work, the collective result is a failure 
of leadership It is hard to fault any single decision that has left a 
potential for archival development unrealized, but the cumulative effect 
thus far has been to limit the number of places where interest in archival 
development has passed beyond informal conversation and exhortatory 
discussion 

In quite another way, reliance on a central role for the established 
researcher provides a second limitation on establishing an archive 
There are relatively few instances in which an ongoing program of 
research provides more than a beginning for an archive Unless the 
research program is so broad as to be relevant to hundreds of remote 
scholars and students, it will most probably not justify a major capital 
investment m the development of archival facilities Consequently, a 
serious commitment to the task of creating an archive usually requires 
a concern with data and facilities far beyond one’s own research in- 
terests The voluntary extension into domains of reduced competence 
and concern dilutes both enthusiasm and viability The transition from 
accumulating one’s own research resources to making those resources 
more widely available may be an easy, natural evolution guided by an 
intimate knowledge of relevant professional and technical considera- 
tions, but if the new archive is to be maintained, permitted to take 
advantage of new opportunities as they arise, and organized to respond 
to new demands as they arc made, the transition carries with it clear 
implications for a continuing and growing involvement in a set of 
problems increasingly remote from the point of departure, the concern 
for one’s own research 

The problems of archival administration may be fairly put in a single 
question What support is available to develop what organizations of 
what sets of data to be acquired by whom from whom for what use by 
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the archive is uniquely concerned with using the best administrative and 
technological processes to enhance the value of data to a remote universe 
of potential users 

A preoccupation with the wants and abilities of others certainly 
should not divorce archival management from a deep concern with the 
frontiers of research and creative thought Indeed, the future of archival 
development rests on the probability that first-rate scholars dedicated 
to professional and disciplinary excellence will recognize the strategic 
and crucial role the data archive will play in the future definition of 
intellectual concerns Nevertheless, the responsibility for making wise 
use of scarce resources to facilitate other people’s pursuit of selected 
lines of inquiry is not the same as making an independent decision as to 
one’s own pursuit of research objectives 
The game of anticipating the future uses of data is by nature a per- 
plexing game with many sets of rules Wide experience and a certain 
rapport with the pioneers scouting the frontiers of research may provide 
insight into the substantive imperatives that should be allowed to shape 
the research of the future On a somewhat less exalted plane, the decision 
to select some data rather than others, or to develop this capability for 
processing rather than that, may be limited if not controlled by bound- 
anes established by fund granting agencies , and the entire enterprise of 
archival development is so new that none of the large sources of 
financial support is really well prepared to make the independent decis- 
ions that Will enhance one archival development and its lines of inquiry 
rather than another 

Whatever choices are now open, it seems certain that today’s deasions 
will soon be judged by the reasonable criterion of tomorrow’s use It is 
hard to quarrel with the presumption that archives now absorbing 
hundreds of thousands of dollars in developmental funds should shortly 
be able to point to substantial quantities of first rate research that could 
not have existed without the capital investment in archival resources 
Of course, a well-established research tradition that can profit from a 
fuller, richer data base ma>, m one breath, promise an archive and its 
sponsor rich dividends, a safe investment, and steady returns over the 
>cars On the other hand, the implicit reference to nsk capital may be 
sufficiently apt to promise a mixed reward to the archival entrepreneur 
and a paucity of funds for any but the most assured venture in creating 
new research resources 

The necessary ubiquity of concern with a distant and dispersed 
^iverse of scholars who are an archive’s constituency and reason for 
being further exacerbates an administrative problem common to almost 
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users The sheer existence of a set of data does not give it value, the 
existence of an elegant computer routine does not mean it will be used 
Ever> aspect of an archive must be fashioned wiili a keen sense of the 
purpose and probability of its use Until the resources available to social 
science are seseral magnitudes greater than they are now, it will be 
irresponsible to do any less 


ECOLOGICAL DATA AND SOCIAL SCIENCE ARCHIVES 

The archive specializing in aggregative or ecological data seems in no 
way sharply different from the archive of survey data There are, never- 
theless, some differences of degree that arc worth noting For example 
the strategy of acquisition for an archive of survey data should be some- 
what more straightforward now and for the immediate future Although 
very large quantities of data are now being generated by survey methods, 
only a small fraction promises to be of more than limited interest to the 
academic community In the absence of any use-onented criteria of 
value, there is a sufficient flood of survey, poll, and market research 
data to inundate a score of archives, but with even the most generous 
definition of probable value across the social science disciplines, the 
more pressing problem of the moment is actually the need for more and 
belter collections of data Moreover, even if other considerations dictate 
a rather indiscriminate acquisition of data of doubtful worth, the dis 
ciplined archive need process only selected collections for widespread 
use by its constituency 

There would seem to be no such relief in sight for the ecological 
archival effort In country after country ecological data have been 
produced for scores of years, not simply during the past twenty years 
There is little duplication among data collections, so the bulk cannot be 
reduced by selecting the better data — or by allowing an academic 
version of an inverse Gresham’s Law to operate and wait for the good 
data to eliminate the demand for the bad Moreover, the mass of 
quantified or quantifiable data in the public record — census data, 
election returns, governmental statistics legislative records and the 
hkc often constitutes the single authoritative source of needed 
information 

A further definition of the necessary scope of ecological archival 
development is proMded by the expanding set of probable users Withm 
a very few >ears the histonan will join the sociologist and political 
scientist m force as an avid customer for archival goods His move- 
ment into the ranks of the consumers of data will, however, only 
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developmental investment, investment m data organized in an inte- 
grated data base is a larger gamble than is investment in facilitating 
established research routines 

A third more or less unique problem of the ecological archive resides 
in the data source A community of survey archives can be expected to 
exert significant influence on the population of data suppliers, par- 
ticularly the academic researchers whose data arc ultimately destined 
to be widely distributed for direct use by colleagues Standardization 
of method and technique and a continued upgrading of the quality and 
utility of data can be expected, at least over time, as interaction among 
research scholars is promoted by shared interests in the archival pooling 
of data Commercial suppliers of poll and market research data may be 
less susceptible to influence, but even there a growing set of common 
concerns bodes well for the future The ecological archive, on the other 
hand, must usually deal with governmental agencies whose policy 
makers are more removed and whose operations are more securely 
established in a complex bureaucracy Although the social bookkeeping 
of many nations provides a rich source of data for social research, the 
ability of the archival community to shape the generation of future data 
must be limited at best Indeed, it seems that initiation of ecological 
archive development finds the would-be archive more often in the 
position of supplicant, forced to demonstrate that national interests 
or agency interests will not be jeopardized by wider dissemination of 
data In many instances, a variety of well-established policies relating to 
confidentiality for respondents who provided the original information 
sharply limits the access of the academic researcher At least in the 
American context, many important contributions to ecological archival 
collections will become available only after the ponderous machinery 
of the data-gathcring bureaucracies moves to more amiable postures of 
co-operation with the academic community 

As something of an innovation m social organization, the social 
science data archive is a part of the modern world but a challenge to 
many of its institutions Where a scholar's capital once was measured in 
part by hts access to information, it is now about to be measured more 
completely by his ability to use information Where a university once 
became great by commanding the resources of scholarship, it may gain 
comparable recognition tomorrow by its ability to share those resources 
with sister institutions The implications of the co-operative, inter- 
institutional, interdisciplinary network of social science data archives 
arc extremely challenging The realization of the idea, however, depends 
on the solution of many problems The excitement that now grips the 
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few \\ho would create the archives must spread to the potential users 
the suppliers the financiers and the administrators ivithin whose baili 
Wicks (he archives must find support AH of these contributors must 
move apace to a sophisticated understanding of the optimum relation 
ships between them and this new inhabitant of the world of scholarly 
endeavor 
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research objectives in view, this is quite appropriate, for secondary 
analysis as part of a complex data base, the survey materials thus 
gathered loo often lack the kinds of variables needed to fit a respondent 
into any quas!*realistic “domain,” much less put him back into the 
actual context in which he lives 

The need to reintroduce context, sequence, and interaction as system* 
atically considered dimensions of our research designs docs not mean 
diverting attention from the subjective attitudinal dimensions empha- 
sized by survey research It means use of aggregate data — census 
materials, election returns, transaction statistics — to characterize the 
changing environment at the same time that it means use of records 
about individuals seen as participants m organized processes It means 
that we need to be able to handle multiple data files 

Modem computers, with their capabilities for manipulating and 
transforming data, are well suited for modeling and simulation work 
and show promise for radically changing the ways in which social 
scientists use evidence We need experimental modeling, on a large- 
scale basis, of political and social processes Whole families of process 
models need to be compared m operation, using the same data base but 
varying the branching and decisional premises The software capabilities 
currently exist, and the next generation of computer hardware should 
make data manipulations and simultaneous documentation of data 
transformations even easier 

What is also needed might be called computer calisthenics — enough 
planned exercise using the available configurations to acquire new 
habits of inquiry In looking at the available data base for studying a 
social science problem, we need lo be sensitive to at least four questions 
that have not routinely arisen heretofore (1) On what terms of ref- 
erence can two data files be interrelated'^ (2) What is the lowest 
possible level of “disaggregation”*^ (3) How can unambiguous identi- 
fication of the various entities to which data records refer be main- 
tained (4) By what stages can interrelated data files be mobilized along 
time dimensions'^ In any work that seeks to make simulations or models 
of political processes, these problems probably will be handled in 
awkward ways for a long time 

For Los Angeles political behavior in recent years, the availability 
of both survey and ecological data files on comparable matters has 
permitted a number of ad Auc explorations of how they supplement one 
another as sources of evidence about the same political phenomena 
After some further elaboration of the points just summarized, this paper 
will describe some results of those explorations 



535 


IWMAINS AND UNIVERSES 

the three generations in THE mrORY OF 
quantitative rOUTICAL ANALYSIS 

For certain purposes to te'' >>'“ 

systematic inquiry into ^ ,vith atliludmal dew 

preoccupied with and inleraclion processes The 

minants and the third with st^uen^ , , 

first relied heavily „se simullaneonsly more Ihan 

materials and the third is attempting 

one file of data relevant for ® „„p|,fied by men lile Harold 

The first generation « P'tt'^P' ThL work stressed empj ca 

Gosnell Andre Siegfried or V O |„s,o,y a»d po^ 

patterns and trends that .acntilicd persistent e”®”', , , 

of particular communities y showed bow a 

of cleavage traced jjj by national P"''”' 

partisan ramparts ,^ l beliefs and the transformati 

Lined the erosion of f';^„^ern technology Thev •'”« 
concerns by mass medi j , ,„,„anls they ,ii,c factors 

drawn and “OP'*"®* ,eece qmte acceptable . slatis 

political patterns " L p„*at. census d“'“ ^le Gradually 

During their period ^^ otative evidence a ec- 

tics were the staple kinds prevail abou ^ Robinson 

a level of disenchantment ca^<»P 1950 „r ,„ch 

sources could be conned into r^^l^ ^ ^„,„„eal f»"’^,„„,e to the 

published his well k"'’*" ” ‘ jechned Seen « “ Liosimuchof 
data had already “Pf’^'^f^Legatedaiafl's'’'' p„silic'"s“'>""' 

Its appeal to political sc scnsilisuy m co 

political behavior much of ibis e 'r- 

second generation ’‘'“tLenenrh'’" " t ud« I 

lent The postwar u" Lplesurse> 

contributions to eb technique - tb' “ „„,|ool ‘ ^ 

There was available a j,„en5ions of P" ,j„„g u'‘’“"Lnira 

probed hitherto inacccs pjeoccup^^'^f nucstop«'’'‘'^^ ^ 

precision Metbodologien P« „a”ei the «'« 

Unngprobl=ms.n ^;;;.Vga.es dominated 

tions elites and other n 



536 


DWAINC MARVlCk AND JANE H BAYES 


Research was by no means cast in a single mold, but the concern with 
sampling methods was prominent Paul Lazarsfcld and his Columbia 
associates cmplo>ed community panel studies Angus Campbell and his 
Michigan colleagues conducted national surseys Heinz Eulau and his 
associates probed the lives of state legislators Ithicl Pool. Herbert 
McClosky, Samuel Eldcrsvcld, and others studied group leadership 
patterns Ho^\e^er dilTercnt their contributions in focus and formula, a 
salient feature of this generation of scholars has been ihcir willingness 
to confine research to the perspective secured by using a sampled 
universe 

The apprenticeship of social science to survey methodology during 
this second generation period has created varied and voluminous data 
files, which are now sufficient to permit useful secondary analyses, 
historical trend inquiries, and simulations based on composite samples 
The third generation is adding some distinctive components to the data 
base A pragmatic (rather than doctrinaire) test of what constitutes an 
acceptable degree of data gathenng rigor seems increasingly to be 
employed, as field research isextcnded into underdeveloped areas, legisla 
tive arenas, top policy-making processes opinion formation situations, 
or various intramural organizational settings It is recognized that 
demanding the best sample, the most neutral interviewing and the most 
carefully standardized questions might pose ideals that could frustrate 
purposeful inquiries A complementary, rather than exclusive, attitude 
toward the relevance of documentary evidence and quantitative data is 
emerging, as the polemics over the behavioral persuasion cease to 
interest, or even frighten, members of the political science fraternity 
Finally, a developmental, rather than static, pattern of inquiry is in- 
creasingly pursued The agenda for research gives high priority to work 
that outstrips available data resources The kinds of data available 
simply do not begin to meet the data requirements of system models 
These trends m current research methodology arc linked to the grow- 
ing attention to research problems arising when multiple data files are 
relevant For a given entity, multiple data files might include opinion 
data, census material, voting figures, biographical statistics, tax levels, 
case loads, budget allocations, content analyses sociometnc rosters, and 
so on The research ideal is to formulate problems that call for the con- 
certed mobilization of these evidential resources rather than the arti- 
faclual elaboration of seemingly autonomous planes of political life, 
each discoNcrable in its own peculiar way — opinions by polling inter- 
actions by observation, trends by transaction levels recruitment by life 
histones Much attention is given to modeling and simulation patterns 



domains and universes 


537 


Common today arc ”f hardwm'Iad sortware- 

figurations of machincty and manipniation of 

svtnch w,ll bo b,g cnoagh to ■" an bo tpec.^ qu.te 

numerous fdos Some '■'^“'""’"'^"fd^rceordsand files »b, chare 
easily Au,o.,.ncsclf-laM,.«<<(^^«^";^^ ,r desirable 

generated at intermediate *“5“ ^ and variables in each record 

Updated inventories nP i ^ preferably by a renm/dicnonory 

ofthe relevant files need to available data base at inter- 

able to provide on-line multistage analysis Even 

S^etv^ous^tatisn.^"^^^^ 

provide Ihe csscnlta . ? wilh ihc us« ® ^ retneval 

Lenee ,s experience and 

people hi"'' ‘"'"phan concerted useofmultpl^f eempi, cations and 

problems that a"*' ^^d to mo'C “n “ debugged and ready for 

Those who have *’ ^5 software pac“ 8 ' ' ^ aradable concerted- 

challenges as soon as p „„ is made computer 

use Too little aetua „f hov» » W ^ d only 

analysis systems yet tb' pmblems is IikeU 

resources to a calistbemes 

by a regimen of compute 

OF SURVEY SAMFUNG 

the ECOEOalCAE BASI 

—I. involved m survey ^ present 


A review of the . ,3 files op'fions containing an m 

llssion of rnumP^^^’prerview questions Taken as 

Selection o ^ „vani evidence 
exposure to ih^ 



538 


DWAINE MARVICk AND JANC 11 BAYES 


sampling, the first task is to define the uniscrse in terms that make 
possible the gathering of empirical cases To sample a universe of 
disembodied ideas, actions, or characteristics is not possible, such 
things must be understood as aspects of human behavior. A universe 
must be reduced to people, groups, events, or records In short, it must 
consist of tangible and discrete elements Second, the universe to be 
sampled must be finite, even if arbitrarily so The time span to be 
covered must be fixed Place boundaries must be drawn Channels of 
communication between researchers and the universe to be sampled 
must be devised Third, the basic sampling units which constitute the 
tangible elements of a finite universe must be designated One may wish 
to consider individual people, family or other group units, ten-minute 
periods of group discussion, or one-hour periods of observations to be 
the basic sampling units In any case, a firm decision on this point also 
IS necessary before a sample can be designed 
A sample is only a part — any part — of the universe in question 
To get a representative sample, resting one's claim upon the impressive 
empirical substantiation of probability theory as used by modern science, 
one must seek to give every element in the universe the same chance of 
selection as every other clement A “representative sample" is a part of 
the “universe in question," so gathered that, according to probabilit) 
theory, the likelihood is maximized that statements of a given precision 
about ifieuni\erseas a whole based on sample information may be made 
with a known degree of confidence 

Such surve> procedure when applied in social science inquiries 
presents at least two difficulties It causes neglect of certain variables, 
notably those of context, and it fosters illusions about the distinctive- 
ness of survey data which arc not warranted by the way such data 
are currently analyzed 

First, people live m organized settings or “domains" within the 
larger universe sampled In the individual's environment, interaction 
and sequence are often known (o be coercive determinants of conduct 
Yet these considerations are attenuated m survey research Individuals 
are treated as the basic universe In voting studies, those interviewed m a 
cross-section survey live in different communities and neighborhoods 
They are stimulated by dissimilar campaign developments, despite mass- 
media communications They face election day ‘choices” that arc cast 
in particular terms by the conscious campaign efforts of local organized 
rival party groups — each trying to tailor the electioneering to the 
audience immediately at hand Of course, when individual voters are 
chosen for inclusion in a sample, it is possible to ensure appropriate 
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one of Its aspects — that more dinTcrence within categories than be- 
tween categories may prevail. As such it is not specifically a problem for 
ecological data, however troublesome it is to deal with such data and 
however sensitive about its relevance most anal>sts of aggregate data 
have become. 

Covariance problems are pervasive also in mullivariant analysis of 
survey data. Statistical developments in social science have begun to 
provide appropriate tests and procedures for evaluating categorical 
variables. Almost all cross-tabulation methods start by assigning in- 
dividual respondents to categories. When the data warrant, it seems 
appropriate to claim, for example, that poorly educated conservatives 
tended to vote for Goldwater in larger proportions than did well- 
educated conservatives. A little reflection will remind us that this 
statement is based on a comparison of four-cell entities in a concept 
space: 

For Goldwater NolforGoIdwatcr 

Well-educated conservatives 

Poorly educated conservatives 

Familiar questions arise. What other variables might be “masked” by 
education? Is the set of people here grouped together, namely “con- 
servatives,” really a proxy for an overlapping set — for example, 
Republicans? Is it really Goldwater, the man, for whom those people 
vote, or would they have voted for any alternative to Johnson appearing 
on the ballot? As one controls for the additional factors, the number of 
cases gets smaller. The idiosyncrasies of the individual respondents 
become troublesomely apparent, TTicre is irony here. No doubt, it is 
useful and feasible to investigate empirically the warrant for all those 
tractable propositions about what “really” might be causing the data 
distributions in question that lend themselves to multivariant testing. 
But it is odd that data records that pertain to individuals are sanctified 
by that fact, although the way in which they are made to yield findings 
is often by treating them in multivariant categoncal sets An ecological 
domain is a useful concept when the variance within the geographic 
entity is no larger than the diflerenix between comparable ecological 
areas. Many analyses of survey data group individuals into categories 
on the basis of multivariant conceptual indices that are as subject to the 
pitfalls of an ecological fallacy” as are geographic indices. Those who 
seek to compare such ecological and multivariant categories nsk coming 
to simplistic conclusions about complex phenomena. By focusing on 
process ’ m time and space, ecological and multivariant categones 
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extent to which one has a basis on which one can interrelate facts about 
an individual to facts about his family, residential block, school district, 
the factory where he works, or the central business area where he shops 
Asking questions about comparability leads to the second problem 
What level of concreteness, of specific character, is the best possible 
level, if a complex data base is given^ What is the lowest feasible level 
of disaggregation that we are able to maintain'^ Data are available for 
land parcels, for blocks, and, of course, for much larger entities Where 
interview materials exist or where statistics about individual per- 
formance can be found in public records, it is possible also tocxplorea 
problem in a sustained way in terms of individual persons 
Another aspect is location One must develop methods of identifying 
geographic entities unambiguously and of specifying the uses made of 
structures and channels found in such entities, whether reference is to 
particular parcels of land or to areas of census tract size This means 
that one must grapple with the problem of how to locate clearly — 
through time — which particular geographic area one has in mind The 
complex and overlapping maze of political boundaries that surrounds a 
citizen consists of areal units that are ambiguously delineated, arbitrarily 
named, and distressingly impermanent Yet this problem must be 
mastered as a prerequisite to successful computer-aided manipulation of 
multiple data files Our Los Angeles experience indicates the feasibility 
of marrying demographic and electoral statistics by fitting precinct 
boundaries and allocating parts of precincts to different tracts when 
necessary In the future, the possibility of using locational co-ordinates 
seems attractive Electronic equipment presently exists with which to 
read into the computer numbers that refer to any area’s quadrant 
position on a grid and which represent the latitude and longitude of 
points on Its boundary 

Finally, the introduction of sequence considerations and time 
dimensions raises additional difiicultics The problem involves not only 
locating clearly whatever area one is analyzing but also studying the 
sequence of movement, the flow of people, the flow of resources, and 
the flow of information m that area It involves doing so on different 
eve s of generality, in terms of shifts from one party affiliation to 
another from one legislative district to another, reportable for in- 
dividuals, groups, and area aggregates that are m “nested” relations to 
one another, and doing so on a monthly, yearly, or longer time period 
*here are “flows” of all measures and sizes that must be 
studied by anyone interested m the information base on which a political 
or social process draws 
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politics characterizing the members of the Democratic and Republican 
Party Central Committees of Los Angeles County (1962) 

At the citizen level, several opinion surveys have been undertaken 
These include (n) a 1960 county-wide interview study of the political 
leadership within the Mcxican-Amcrican community in Los Angeles, 
(6) a similar 1962 study within the Los Angeles Negro community, (c) 
a county wide interview and questionnaire survey concerned with civil 
liberties and civil rights problems linked to the November 1964 election 
day choices (autumn 1964), and a similar county-wide interview and 
questionnaire survey concerned with political participation and 
socialization patterns (summer 1965) 

The availability of such comparable survey materials, census data, and 
electoral statistics about Los Angeles County enables the integration of 
disparate data files for analytical purposes This can be extremely 
advantageous Survey data taps altitudinal, socioeconomic, and 
political information in considerable detail and on an individual basis 
Electoral and demographic data compiled by census tract arc accurate 
and comprehensive in coverage but cannot provide the substantive 
or the subjective “depth” available in survey materials Combining 
survey with aggregate materials should allow an investigator to 
emphasize the advantages and minimize the disadvantages of each 
type of data 


One way to illustrate the significance of context is to focus on political 
process within a type of geographic domain The particular process of 
interest here is that of democratic citizen participation in political and 
civic affairs To think of participation as a process is to include within 
the concept environmental, altitudinal, and social interaction patterns 
Political participation then can be viewed as the process that relates the 
individual to a corporate unit If such a definition is useful, then the 
type and intensity of participation mode should be related to the 
attitudes and interaction patterns linking the individual to any specified 
corporate entity The following analysis attempts to identify and in- 
vestigate the nature of such a relationship with specific reference to local 
governments as the corporate entities in question 

data base for this excrase includes political and socioeconomic 
information for a random 5 per cent sample of the 1 ,300 census tracts 
m Los Angeles County This information is available on the POLCEN 
file described earlier In addition, this analysis draws upon 700 inter- 
views obtained on a randomly selected basis withm the same 5 per cent 
tracts (It should be noted that the census matenal is 
that 01 1960 The electoral statistics include the 1958, 1960, 1962, and 
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Stability scores (over 25 per cent stable) The correlation coefficient for 
the relationship between education and the average TVC ratio is very 
similar t— 5349 with a significance level of cr = 137, Z= 589 
Income also correlates with the TVC ratio to about the same extent as 
stability and education level The correlation coefficient and the 
significance level are r— 587»a= 137,Z= 629 A plot of education 
versus income shows that these two factors are highly correlated with 
each other, as one might expect The correlation coefficient and the 
significance level are r = 76, a = 137, Z = 87 However, when 
education is plotted against stability, the resulting scatter of points does 
not suggest a regression line The same is true for a plot of stability 
versus income level The conclusion is that stability is independent and 
unrelated in any simple or direct way to education or to income with 
respect to the average TVC ratio Furthermore, it is clear that education 
and income levels both vary in their relationship to the average TVC 
ratio 

The figures for stability and either education or income can be used to 
predict, with fair accuracy, the average TVC ratio for any tract 
The multiple linear regression formula for this prediction is* 

y, = 47 44 + ( 2495)Jir, + (1 43)^2 + ( OOOOS)^'^ 

where Yg is the calculated average TVC ratio for the 1958, 1960, and 
1962 elections m any given census tract in Los Angeles County, Xi is the 
ratio of those living m the same house in 1 955 as in 1 960 to the number 
of people over five jears of age in 1960, for the given census tract, X 2 is 
the median education level in years of school completed for the census 
tract, taken from the 1960 census, and Xy is the median family income 
for the giv en census tract 

This regression formula was computed with data for the 5 per cent 
random sample of tracts used as the sampling base for anopimon survey 
in the summer of 1965 With this formula, the calculated average TVC 
ratio varied by i8 points for 91 per «nt of the tracts when compared 
with the actual average TVC ratios When applied lozdjfferent 5 per cent 
random sample of Los Angeles census tracts, the formula was accurate 
m predicting the average TVC ratio withm ±8 points for 97 per cent 
of the census tracts Consequently, for a given education and income 
level, the average TVC ratio can bca very accurate index of geographic 
stability wiihm a given census tract 
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may have been the same as two years before, or it may have decreased 
Changes m the TVC ratio will not indicate which 
If we concentrate on increases in voting turnout from one election to 
the next, we could expect increases m all tracts when measuring the 
difference between off-year and Presidential year TVC ratios Immigra- 
tion cannot be adduced to explain such increases Heightened interest 
m the election day choices would seem to be the principal factor 
determining the higher TVC score in a Presidential year 

Empirically, not all tracts exhibit such differences in electoral partici- 
pation using this index of change About 30 per cent of the Los Angeles 
sample survey tracts varied hardly at all in TVC ratio levels for con- 
secutive elections These we refer to as “steady state” political domains 
At the other extreme, another 27 per cent showed increases of 10 or more 
percentage points for national over off-year election TVC ratios These 
v.e call “vacillating” political domains Between these extremes fall the 
rest of the Los Angeles sample survey tracts, with TVC ratio increases 
ranging from 2 to 9 percentage points (Electoral statistics were not 
available for 1 1 of the original 67 census tracts ) 

The TVC ratio derives from election statistics, generated for census 
tract units and spanning the period 1958 through 1964 As such, it 
serves as a bridge linking survey and census materials First, the 1965 
opinion survey results can be directly examined m light of 1960 demo- 
graphic facts about the sampled tracts Second, it is possible to interpret 
1965 opinion patterns by reference to the 1958-1964 electoral change 
patterns of the tract areas Because of the previously noted correlation 
between the electoral behavior of tract units and such I960 census 
variables as residential stability, income level, and educational level, it is 
possible to make such comparisons and feel that any gross demographic 
changes making the I960 census figures unreliable m 1965 have been 
adequately controlled by the TVC ratio classification Basic ecological 
characterizations of the three types of “political domain” are presented 
in Table 1 

Interviews conducted in 1965, then, can be grouped according to the 
“political domain” m which the respondent lives When this is done, our 
1965 respondents are distributed as shown in Table 2 In comparing the 
three domains, note first that the population distributions by sex, age, 
and residential stability m each category are practically the same More 
definite contrasts emerge when “steady state” and “vacillating” 
domains arc examined on other counts First, the proportion with at 
least some college education decreases from 55 per cent in “steady state” 
to 37 per cent m “vacillating” domains Similarly, the proportion 
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engaged m white-collar occupations declines from 57 to 40 per cent, 
and the proportion with high incomes declines from 44 to 21 per cent 
These economic and educational differences that were obtained by 
using survey materials correspond well to the domain descriptions 
according to 1960 census data displayed in Table 1 The population 
proportions in “steady slate” and “vacillating” domains decline on two 
additional points (1) the proportion married decreases from 82 to 65 
per cent, which is an unusually sharp contrast to find on this variable, 
(2) the proportion of home owners falls even more precipitously, from 
79 to 39 per cent 

These differentials may suggest analyses in terms of what social forces 
sustain different patterns of participation Our concern here is with the 
attitudes and political participation patlcrns associated with residence 
in domains having markedly different levels of continuity m electoral 
participation More specifically, what kinds of attitudes toward local 
political and civic affairs characterize individuals living m “steady stale” 
as opposed to those residing in “vacillating” domains'^ A probing of 
this question illustrates the possibilities available in the use of process 
as a focus for analysis Table 3 uses education as the test variable 
The first general observation concerning Table 3 is that the college- 
educated group m each category give very similar responses, indicating 
that domain makes little difference m their attitudes toward local affairs 
Second, the non college-educated group is not homogeneous m this 
sense, and the varying domains seem directly relevant 

More specifically, comparing people of different education levels 
living in the same type of domain suggests the importance of the political 
context Consider first the “steady state” domain On count after count, 
similar proportions of those with no college and those with college 
education said that (1) they understood local issues fairly well, (2) 
they have sometimes tried to influence local government, (3) sometimes 
they might try to influence a local decision, and (4) they have felt that 
local police give them serious attention Finally, m considering how to 
go about influencing a local government decision, those who said they 
would rely on the vote exhibit some educational differences, but the 
proportion who would turn to other individual modes of action is 
comparable 

Perhaps the most interesting finding to emerge is the extent to which 
educated people living m “vacillating” political domains mention group 

cfrortsasappropnatefonmplementmgtheir political goals In “vacillat- 
ing domains, nearly three times as many respondents with college 
education (56 per cent) as with no college education (20 per cent) 
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indicated that they would try to use group methods to influence local 
ofiicials The evidence in these data files cannot tell us whether this 
contrast reflects a lack of mutual neighborhood involvement or whether 
It reflects leadership among the educated in creating organizational 
structures presumably needed to compensate for or modify the character 
of the political domain itself 

Consider next the impact of a “vacillating** political domain While 
well-educated people are apparently little alTected by it, their neighbors 
with limited education respond to the ambiguous political scene by ex- 
hibiting discouragement and pessimism Asked whether they under- 
stood local issues, onI> 48 per cent gave a positive response, compared 
with 69 per cent for their well-educated counterparts On two other 
questions, the levels of positive oncntation to local affairs indicated by 
those with limited education were more than thirty percentage points 
below those registered by their more fully educated neighbors 


SUMMARY 

Initially, this research began with precinct information, census tract 
information, and interview materials gathered m a set of primary sam- 
pling umts (psu’s) The task of rcndenngprecincts, tracts, and psu’seom- 
parable could not be avoided by any enterpnse seeking to use these 
different sources of evidence to study a common problem We have been 
concerned with the importance of being able to make concerted use of 
multiple data files to analyze political processes Furthermore, we have 
tried to explore some of the intellectual puzzles involved in relating 
disparate data files to each other Our focus has not been primarily on 
opinion data patterns, nor have we tried to classify respondents by 
census criteria Rather our method has been to use electoral statistical 
patterns to classify people in terms of the political domains in which 
they participate 

Our findings show that educated people seem to be liberated from 
their environment The neighborhood process characterized by vacillat- 
ing electoral participation does not seem to affect the attitudes of 
educated people tow ard local political life People with limited education, 
however, are considerably more vulnerable to the political processes 
that charactenze their domain In * stead> state” domains people of 
limited education on man> counts express attitudes similar to their 
belter-educated neighbors This suggests that the political processes m 
these domains exhibit substantial integrative components On the other 
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hand, when people of limited education arc found in ‘Sacillatins” 
domains, they often seem to lack both the ability and the will to par- 
ticipate m the local political process 
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INTERESTS 

As our concern is with ecological data and the kinds of uses to be 
expected from the systematic collection, storage, and manipulation of 
such data, there are three mam types of interests involved here 

1 The regional scientist in a broad sense of the term, that is, the 
person engaged in the study of spatial phenomena, be it with emphasis 
on sociology, sociography, demography, economy, psychology, or 
political science 

2 The producer of ecological data, that is, of information about 
people and their behavior, on the one hand, and the physical setting in 
which It takes place, on theother, and of information that can be located, 
that belongs to a place or moves between places 

3 The person who is professionally engaged m the manipulation of 
the physical environment, particularly the urban and regional planner 
and the local decision maker 

The relationship between these three professions has not been an 
altogether happy one in the past Space, compared with time, has been a 
variable widely neglected in social and economic research The inter- 
relation between spatial and social structure, although of high relevance 
for the physical planner, has hardly been explored 

1 Social scientists have been only slightly interested in the physical 
environment or m the practical implications of their general findings 
for the planner, they have shown a strong preference for either surveys 
or census data and only recently have become aware of an approach that 
integrates the two 

2 Data producers have preferred the macro level and have not 
bothered with small units, partly because nobody asked for such in- 
formation and partly because the handling of such large masses of data 
prov ed to be difficult 

3 Planners, with their typical background as designers, deal with 
the physical aspects of the city in their daily work but have difficulties 
m understanding changes and relationships they cannot see or draw 

kind of questions they put to the social scientist are not exactly 
those he has a ready answer for 

To be sure, much work and time and money have been invested m 
applied research during the last decade But the practical and theoretical 
effert had been limited, for reasons that cannot be dealt with here 
Lack of data has not been the primary problem On the contrary, 
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little IS available for the prewar periods In most cases, the analyst has to 
rely on published figures, working with the raw data or with cards or 
tapes is still the exception, partly for reasons of secrecy * Because the 
data are kept, processed, and published at the eleven Stale Census 
Offices, and not always in an identical fashion, comparative research is 
not an easy job Many questions cannot be answered with aggregated 
data for communities, for the time being, however, work at a level of 
aggregation below that is extremely difficult And finally, only informa- 
tion that is part of the census programs or other official statistical 
activities can be found at the census offices So far, no scheme has been 
developed to integrate materials from different sources 
Some of these problems could be solved by a research-onented data 
center apart from or within the official agencies, but most of them would 
not DATUM has sizable data collections (cards and tapes) on the com- 
munity level, mostly from the 1961 Census Its approach, however, is 
not to compete with existing institutions but to fill obvious gaps As 
some of the limitations will be overcome in the near future, its activities 
are directed toward the 1970 s rather than the 1960’s Therefore, it is 
worth having a brief look at the trends 

TJi£JVDS 

The most relevant changes in our context have to do with the 
techniques and tactics of data production, data handling, and data 
uUUxation 


Data Production 

The General Census of 1970 is likely to differ from earlier ones in a 
number of ways 

1 The enumeration district used to be an oddly shaped, purely 
technical unit for which no data were available or asked for At least 
in the larger cities, the new enumeration district will be identical with 
a block (that is, buildings surrounded by streets, with somewhere 
between 50 and 500 people), which at the same time is the lowest level 
of aggregation 

2 There is a general tendency away from publishing everything m 
books and tables but rather toward keeping data on cards or tape at the 
disposal of those who want them This will become crucial mviewofthc 


*Thc Qutllcnnach%eis op eit refers to published materials only and does not 
contain any information on card or tape 
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development The planning boards will be staffed more and more by 
people with a background m social science First attempts are being 
made to use the computer and the data repositories fortasks beyond the 
mere automation of administrative procedures 
This IS not general practice yet The bureaucratic structure, the usual 
time lag between big cities and the smaller ones, and the lack of people 
trained in the use of data, methods, and machines will slow down the 
rate of progress possible with the computer But it is certain that the 
situation in the 1970’s will look quite different 

PERSPECTIVES 

These trends should be taken into account when evaluating the pros- 
pects for systematic collection of “ecological" data The DATUM 
approach, in the long run, aims at minimizing the costs and maximizing 
the benefits for public as well as research institutions It starts from the 
assumption that it will be wiser, more practical, and more economical 
to support the growth and generation of existing data repositones than 
to establish some kind of “super archive ” 

The arguments in favor of such a decentralized solution may be briefly 
summarized 

1 The smaller the unit of analysis, the greater is its uiilil} for urban 
research Statistical studies today deal with data aggregated at the level 
of the community or — within larger cities — thedistnct Little can be 
said in favor of these umls, which are large, oddly shaped, heterogenous 
in structure, and incongruent with natural areas, however defined If 
data both on the physical and the social structure were to be assembled 
for a much smaller umt such as the block, these would be minor prob- 
lems It would then be possible to “construct” anyarea by accumulating 
data from several blocks until they show the size and shape desired, for 
example, of a specific market area, school district, polling district, or 
any other administrative unit 

2. The sheer number of these units makes centra! storage and retrieial 
unfeasible At the present practice, data are available for about 25,000 
local units (communalities or distncis) Although bulky, such an array 
of information could be handled at one place, if necessary If a block 
system was introduced, information for another 200,000 units would be 
gathered and asked for It is, of course, technically possible to store the 
data from one census for all the blocks at one place, that is, the national 
Bureau of the Census, and keep them there forever For all practical 
purposes, however, the data would not be accessible 
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population The nature of “proccss-produccd” data makes the dis- 
tinction even more ambiguous If archives arc to become instruments of 
research, besides their storehouse function, any limitation as to the 
nature of the data will limit the potential use of the archive 
7 The main (hsiinction is between tlaia hanks defined b) area and those 
defined by topic To pul it crudely, one type of archive would comprise 
all information that could be located within a given area, without much 
regard for the kind of information, the other type would limit itself to 
the data relevant for a specific problem, the location of the data being 
one variable among others In our context, such a “special-purpose data 
collection" could be concerned with topics such as occupational and 
residential mobility, traffic habits, urban renewal, or consumer behavior 
It could be restricted in area as well, but its full potential lies in (he fact 
that it would contain information from regions with difTcrcnl structure 
8 Local data archnes and special-purpose collections depend on each 
other The mam consumers of a data archive defined by area will be 
local planners and decision makers as well as regional scientists in- 
terested in this particular area The mam consumers of the special- 
purpose collection will be regional or social scientists not satisfied 
with data taken out of one regional context In many eases, the local 
archive can supply information on a specific problem that will then be 
included in the special purpose collection, whereas the latter facilitates 
comparative research, produces findings of more general relevance, and 
supports the development of indices typologies, computer programs, 
etc , which in turn can be used m the local data bank 


PREREQUISITES 

Although the trends point m this direction, such a network of local 
archives communicating with one another and with central holdings will 
not come about and function automatically To be sure, the whole 
scheme does not depend on the complete and simultaneous creation of 
local archives all over the country It could function as a step-by-step 
operation, largely parallel to the introduction of electronic data pro- 
cessing into public administration 

Several other conditions, however, have to be fulfilled before such a 
configuration can work satisfactorily 

9 The local data archives should be basically organized in a similar 
fashion A certain degree of similarity is secured by the standardized 
practices of census taking, by the rather uniform procedures in German 
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on four levels at the same time These four activities are reflected in the 
organizational structure, which is described elsewhere ’ In the long run. 
DATUM IS expected to provide (I) data, (2) information about data, 
(3) instruments of analysis, and (4) training 

Data Collections 

For reasons elaborated in detail in this paper, little cfTort has been 
made to collect and store large masses of data The mam emphasis will 
be on data that arc relevant to a specific problem but stem from dif- 
ferent sources and are not normally accessible at the same lime The 
collections will thus be topical m the sense of Point 7 and will be part of 
ongoing research projects The first DATUM projects have to do w'lth 
questions of occupational and residential mobility and with social 
structure and social change m urban renewal areas * 

There are aggregated data that arc of obvious relevance to these 
topics, such as census data on the structure of the labor or housing 
market There are surveys especially designed to cover the problem 
But beyond that, there are numerous other pieces of information that 
have to be located, pulled together, and related to other pieces of in- 
formation Many sample surveys, for instance, contain questions on 
first and present job, on location of home and firm, on recent moves 
Instead of rescuing and storing complete studies, with the technical 
problems of transformation, formats, etc , DATUM will extract only 
the necessary information, make it as compatible as possible with other 
data, and bring it into a computer-manipulable form These “special- 
purpose data collections,” rather than the original data can then be 
used elTectivcly by the research community 


Documentation 

As the purely archival functions of DATUM are played down some- 
what, It becomes even more important to know where to find what and 
how (cf Point 9) Data relevant for regional planning and research 
have to be documented centrally before any efficient communication 
between existing data collections can be realized The coverage here 
should be as broad as possible and extend far beyond the research 


*‘Daiaon DATUM” paper presented at ihe Third ISSC Conference on Data 
Archives London April 1966 

rvt-rSi? von DATUM ” Doc No 914 001 (Bad Godesberg 

Doc No 
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data banks There exist well-cstabhshcd routines for the automation 
of certain administrative activities,’® there is the widespread, if stiU 
somewhat abstract, belief that the goal is an “integrated” dala-process- 
ing system, but there arc only rudimentary notions of how long-range 
planning and policy decisions can be based upon an urban information 
system how to find the relevant data, how to make them compatible 
and accessible, and how to manipulate them ‘ ‘ 

Training 

Public administration is a rather conservative and inflexible institu- 
tion Acceptance of a new technology and a new style of problem solving 
will be slow , but once it is accepted in principle, changes m organization 
and philosophy will be massive and thorough It is extremely difflcult, 
however, to adjust the personnel to these changes There is a definite 
lack of people familiar with research methods, data handling, or com 
puter techniques Academic facilities hardly provide this kind of train- 
ing What IS needed is a wide range of programs designed for difierenl 
groups of people 

In Its training program, DATUM is going to concentrate on 
mainly two levels, there are other institutions that take care of other 
interests In a series of one day conferences, leading planners and 
administrators will become acquainted with the general pnnciples and 
problems of electronic data processing and its application m research 
and planning The mam emphasis will be on two week seminars for 
younger people from planning departments census offices, and research 
institutions In these small workshops, the participants will be exposed 
to a case study, taken from one of the research projects, which gives them 
a chance to become familiar with the potential of an urban information 
system 


For a num^ of yean ihe Kommunale Gemeinschaftsstelle fur Verwaltungs- 
proj^ls Cologne has carefully siudied thes- problems and made realistic 

‘ • The expectations of urban planning and urban research vivA vis local clecironic 
Mta processing machinery have been discussed and formulated by a DATUM 
^nrercTO in August 1965 see Oaten and Datemyzteme fur Stadtplanung und 
Regwmlfonchimg Doc No 903/01 6 (Bad Godesberg DATUM January 1966) 

nA-rT-f'.? Doc No 907/003 (Bad Godesberg 

DATUM November 1965) 

'*The Verem lur Kommunalwissenschaften Berlin plans a senes of seminars 
for top management m public administration. 
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social ecological research ^ As carlyas the second half of the eighteenth 
century, government statisticians had assembled extensive data on the 

demographic, economic,and religious structureof communities through- 
out much of Austria Hungary, this tradition of social, economic, and 
cultural cartography (or ‘ social geography ) has strongly influenced 
Austnan methods of data collection and handling down to the present 
day ^ 

Partly because of the development of Austrian “sociology” as meta- 
physical social criticism partly because of traditional resistance by 
Austnan academic circles to the establishment of social science research 
centers v-ithin the universities, and partly, of course, because of the 
institutional advantages in data collection originally enjoyed by the 
temtonal state, innovations in techniques of empirical ecological re 
search t>*pically emanated from ofTicial government statistiaans Thus 
the first data based efforts to move analysis from vanations at the level 
of proximal temtonal units to variations at the level of individuals 
were made by government electoral statisticians m the late nineteenth 
and early twentieth centuries Their technique consisted of drawing 
inferences about individual behavior from their observations of con- 
trasting sets of juxtaposed maps and from simple statistical manipula 
tions of aggregate data By the 1920 s the central statistical office had 
advanced to the analysis of primary data on some types of mass political 
behavior, the data were collected by means of colored ballots differen- 
tiating voters by sex 

Finally, m the early 1930 s, Paul Lazarsfeld and his associates 
launched their pioneering studies on the social relations and attitudes 
of youth and unemployed workers in Maricnthal, using a combination 
of primary data on personal attributes and behavior, derived contextual 


For rc%io*$ of Auiirian traditions in social science sccL Rosenmayr Austnan 
^lology — Pasi and Prcseni Notes on Some Histoncal Trends and Present 
Acmui« Soelai Science Information ^ (l%S) pp 85-101 L Rosenmayr (with 
r* ^lohgy in Austria flutory Present Activities and Projects (Graz 

Koein 196<)> F O Heyi and L A Vaskostes “Soziologic m Oeslerreich ** m G 
Bsermann (^ ) Die gegennartige Lage der So lologie Bettrage zur So'iologie 2 
(Siuttpn I9«) and F O Hcyt and L. A Vaskovics Sozialforschung und 
^zioJogi- m Onlcrrach Siand und Ejilwicklung seit 1945 ** Koetner Zeltschrifi fur 
So lohgleand So^lalpi}cholog,e /» I <I966) pp 94-JI6 On recent desclopments 
^ Bodnia (ed ) & lologie und So iologlestud urn (Vienna Spnngcr 

Verlag, 1966) pp 110-119 

, ^ ^gejmann ~Dc lopographisch-slatistischen Werke Oesterreichs 

Ulttcif ur-gcn derCfographiichen Cesellsehafi In \\,en 69(1927) pp 273 282 and 
U Arnberg-r Grundlag-n und Methoden zur kanographischen DarstcIIung dcr 
^ WkenngsimtwKkluns der krtnen hundert Jahre in Oesterrcich " Mliteifungen der 

Oesterreichiichen Ceographischen Cesellsehafi 702(1960) pp 271-313 
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On the other hand, the mere availability of a sufficient range of usable 
raw data — not only on individual attributes and behavior but also for 
terntonal units from the level of the commune {Gememde) and admin 
istrative county (poittischer BezirK) to the aggregate national level 
ought to encourage rapid innovation in research as well as in methods 
of analysis 


AVAILABILITY AND TYPES OF DATA 

Several types of organizations serving highly diversified commercial 
official, and academic clienteles are presently engaged m the generation 
of data relevant to ecological analysis Among the most important, 
although not necessanly alone in their fields, are the following 

1 The Austnan Central Statistical Office This produces a wide 
range of global and aggregate data for different terntonal units * 

2 The Co-ordinating Center of the nine Provincial Statistical Offices 
This operates within the Economic Section of the Lower Austrian 
Provincial Government It collates financial, demographic, social, 
cultural, and political data (both global and aggregate) for communes, 
counties, election districts, and provinces, and then it republishes the 
data in non machine readable form ® 

3 Commercial research institutes, such as the Vienna-based institutes 
for regional (both rural and urban) research and planning and for basic 
economic research These institutes maintain large staffs of experts in 
statistical geography, social ecology, and related disciplines Each owns 
extensive data files, partly m punch-card form, consisting of both 
primary (global attnbute) and aggregate data for terntonal units 

4 The statistical sections of the major economic interest groups and 
principal semipubhc occupational chambers (/Tammern) They compile 
data on individual party and group affiliations at different levels of 
society and polity “ 


* OcilcTTcichivrhes Siaiisti$chei Zcnlralamt 

‘ Vertnndungssiclle dcr Ontcrreichischen Bundeshnder. Amt der Nicderocstcr 
r«jchi«*en Landctregicning, 

Mmtitut fur Raumforschung und Raumplanung. Vienna and Institut fur 
W jrtichaftirorschung Vienna 

permit a comprehenme luting of these bodies Suffice it to say 
that ma;or private- economic interest groups m labor business, and agriculture 
as well as their principal semrpoblic chamber affiliates are critically important 
cata-c-nmten They are probably more important than cquisafent organiraliof” 
in nwt other Eori^n societies because of the pervasive intcrloclcmg organization 
oi Ausiran socal political and economic life These organizations frequently 



rCOLOCSCAt DMA O'! AVSTRH 

5 . Ccn,m«ci.il .unry rt-cnrth orpnitfiow. Uttc ate three of 
these at the national Ic'el. all biseJ ' Researeh 

0. Acadenne thatch ^taf^LL soetolopca, 

S'X'oaH." "he ranni; and yoath. unag santple sunep 
siithin small terlilorial units.'” .j,,rt,nn 

Uefltstthteeorthesesetsnr^ ; the lait three 

cf pnmar>- or t3cri>«J apprep’c ata of statistical mfor- 

are 'oneetned chielly S aSu.» o, ^hanor of 

malion or iur>c)‘ rc<ea^cfi on I 

lndi*iJualx. , , jhts fund of data: 

Seseral shortenmlnfs .empet the utility 

1 . While the deselopment of ..self, since coiapata- 

ereasinply sorhisticated oser time ,he noaptoSl >a“!' 

hilny *ith eatlicr ‘""‘‘lllrf Seoal Science Studies O 

vniotfsupported niid ^''■'^'^’''’**“^j^^^„m,rt!vharh“atdl)'ii' 

.sd.ir.cuU). lurN^iS base This is 

of both ptogf^iphic areas undertaVen or 

for commercial surse) research political p 

to the pragmatic momentar)- n 
ihcir sponsoring interest asoscia i 


pertisp* the 

posies, mom 

• Austrian Gatlup Institute «« S3 

In Bddmon to these three Mn.v«s..y 

alTiluted ssith parties ®r *”*5’^, * fAj Souelo?"' ^''of data 

Soct« Sludics Group t*’-^^.'"S„fSeWe*n' ’'^'’‘^iSe^miUr t>P« s 

Soz.alys.ssenschaftliche [^^""Suu.es ihal .fic research 

of Vienna. Other 8caJcni<c 

(conles.uat data on ''’‘^‘'’‘*^,y;jyBndStali«'«<^"'J,j/ politics »n^ Advanced 

afe‘.tnsiiluic(orrjnpiricatS« InsiittHf f®r ^8^ Insmu'® t^or . 

Untiliut fur AgrarptdittK ,«,»/&»• *<i*ere 4„flthel966 

Stud.« La Seientific Research (/««»" •» ,nc!ud"'8 


jfW). Vienna. .ci<Kiof pottsC^**?^,ffniy yea« 

‘ A compilation and di^ „,srVed the end of t jjy god . j^mJis I 

tamenl/ry eleelion. .l*h . rmiw^' gjp,., i ..il «P” 

It m posmar Austria. 

cnoMonJihe PoUuccl Order In ArP 



572 


RODNEY STIEFDOLD 


2 Another problem that se\erely compromises data usability is the 
inadequacy of many of the oflicial Austrian census categories For 
example, the census classification Handel wul Verkehr includes trans- 
portation workers as well as merchants and bankers, while the term 
“salaried employees’ refers simultaneously to salesgirls and managers 
of large industrial enterprises 

3 In addition to problems caused by the heterogeneity of socio- 
economic groupings in the occupational statistics, another perplexing 
feature of official aggregate data is the failure of the Central Statistical 
Office to include in the publications on electoral statistics Gemetnde 
(commune) and other boundary changes that occur from one national 
parliamentary election to the next These data can be found, but the 
process is often time consuming and expensive Nor are the occasional 
publications on boundary changes cross referenced in electoral publica- 
tions Finally, these publications often are unavailable for a decade or 
more following issuance of official rating statistics, which considerably 
compromise the accuracy of many of the existing electoral studies of 
twentieth century Austria ‘ * 

Nonetheless, despite difierent levels of sophistication in sampling 
and gathering, despite frequent failure of publications to specify prob- 
lems encountered m delimiting precisely the territorial or aggregate 
individual sampling umts, despite difierent levels of “hardness” of data, 
let alone ‘cleanliness’ of available machine formulated data— “many 
of which problems persist elsewhere — the bulk of ecologically relevant 
data IS probably comparable in usefulness to that produced in most 
other Western European countries 


A preliminary list and discussion appear in Stiefbotd Polilische Mcinungsforschung 
1954-1965 m Peter Gerlich Georg Ress and Rodney Stiefbold 
Ti? (Vienna Yerlag fur Jugend und Volk 1968) 

e Austria electoral system has had an interesting effect on the generation of 
survey research studies, the number of deputies allotted to specific election districts 
number of citizens residing there (including minors), 
wif^ ^ premium on a good showing at election time in districts with 

[^^,11 ® number of (highly classified) survey studies of 

western Austria This subject is discussed 
An^r,U Elections and the Pohuca! Order op at Chapters 3 5 The 

wlm described m Uwc Kilzmger The Austrian Electoral 

System Parhamemary Affairs 12 (1959) pp 392-4(W 

1945-1961 IS the Oesterreichische Statis 
tisches Zentralamt Gebiets und Namensanderungen in der Ven^allungseinteilung 
CarlUeberreuterVerlag 1962) Italsoconta.nsb.bl.ograph.es 
of sources on boundary changes m the Austnan provinces s.nce 1869 
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mistrust stjli casts long shadows over social “science” research, the 
problem of access can scarcely be overemphasized. 

The inauguration in 1961 of the Institute for Advanced Studies and 
Scientific Research (IHS/Vicnna), however, marked a positive step 
toward removing this barrier to social science research. The Institute, 
an independent academic center for advanced training and research 
in the social sciences, is supported by grants from the F ord F oundalion, 
the Austrian Federal Government, and the Vienna municipal 
government. It was established, in part, to redress the existing research 
imbalances and supplement the teaching and use of social science 
methods in Austrian higher education. 

The Institute maintains a complete array of unit record equipment 
for the processing and analysis of punch cards, and has recently com- 
pleted installation of an IBM 1620 computer. Plans arc being developed 
to acquire, process, store, and maintain machine-readable data and 
supporting documentation from a number of organizationally distinct 
Austrian data sources. IHS/Vienna plans to archive data from Austrian 
demographic, fiscal, agricultural, cultural, religious, and educational enu- 
merations. It is also negotiating with relevant commercial organizations 


dcr Lupe," Pohusche Perspfkmen. Nowmber !965. Indeed, the quesuon of whether 
Of not it 15 oen Ugiilmaie to carry out such studies has been a subject of heated 
debate In ihe Socialist Party until very recently. Among many articles on this subject 
appearing in the Socialist bimonthly Die Zukun/i, see especially that by WiHi 
Lmanec, “Memungsforschung und politischc Arbeit.” 23 (December 1965), PP 
These taboos are rapidly disappeanng, howescr. There has been a marked gro»lh 
of serious public interest in politics, as well as an increase in the gathering and 
dissemination of information about politics; furthermore, incumbent political and 
social elites appear to be becoming more tolerant of the spread of both information 
and cTitiasm Several factors have abellcd these developments 
t. The gain in political importance during the late coalition era of the nonparty 
press 

2. Competition among several publishing houses (such as Moldcn, Herder, and 
Euro^) for a newly discovered commercial “political interest” market 
3 The writings of a number of bright, well-informed, and incisive political 
journalists, respected as “politicaJ scientists.” who offer constructive cnticisms of 
contemporary politics 


4. The emergence of internal “technocratic" oppositions m each of the major 
political parties 

5 Penistmt, determined, and eventually highly effective reform mongenng by 
leading publiasts in the democratic center of the political spectrum 
b 1 he enhanced political relevance and importance of old — and the esiablish- 
mCTt^ new — nonparty, independent research institutes 
7. The creation of the new College of the Social and Economic Sciences in Linz 
and the concomitant passage of new federal legislation designed to modernize social 
saence cumculurm m Austrian institutions of higher learning 
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3 Type of data base utilized Arc they interested m global and aggre- 
gate data for territorial units, survey and derived contextual data for 
individuals, or both simultaneously’’ 

4 Actual methods of analysis employed m processing data at one level 
or the other 

In terms of the first three criteria, several partly overlapping but 
nonetheless distinct approaches can be distinguished We might desig- 
nate these dilTerent approaches as follosvs 

1 Religious sociography, an approach identified with the Institut fur 
kirchhche Sozialforschung (IKS/Vienna) and its former director, Erich 
Bodzenta This approach utilizes territorial aggregate data (or the 
original raw individual data in those cases where local authorities have 
preserved them and will permit their use for scientific research purposes) 
m an attempt to describe and explain variations in individual behavior, 
particularly with respect to individual participation m church-related 
activities 

2 Social ecology, an approach identified with the Social Science 
Research Center and its director, Leopold Rosenmayr, with the work 
of Rosenmayr’s student, Richard Gisser, with the Institute for Rural 
Sociology and its director, Ernst Lagler, with much of the work of 
Erich Bodzenta, and, finally, with some of the more recent studies 
sponsored by IKS/Vienna and directed by LaszJo Vaskovics This ap- 
proach IS characterized by the combination of primary data on individual 
variations (from the raw, nonaggregated census data interviews, or 
sample surveys), with detailed ecological analyses of the proximal 
contexts of such variations It not only seeks explanations for current 
vanalions but also attempts to isolate sources and dimensions of change 

3 Political ecology, an approach identified with the studies of Herbert 
Tingsten Robert Danneberg, and the Federal Statistical Office in the 
First Austnan Republic, and, among others with Walter Simon Otto 
Lackmger, and the Vienna Municipal Statistical Office in the Second 
Republic Based on official aggregate census data voting statistics, and 
local electoral registers, this approach involves the analysis of electoral 
turnout by categories of citizens differentiated by sex, age, and/or 
occupation, and the study of official tabulations of party votes by sex. 


Internationales katholisches Institut Oir kirchliche Sozialforschung, Abteilung 
^ 1962, Insmut fur kirchliche Sozialforschung. 

Methodological questions are discussed in Erich Bodzenta el at Kirchliche 
^malforschung un-1 Seelsorge BenchteausOesterreich Der Seetsorger, 2(5'(I956) 
t Bodzenta, “Forschungen m Oesterreich,” Social Compass, 6 (1959) 
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have occasionally worked with combinations of individual and terri- 
torial data for this purpose 

This sketch identifies the principal ecological research traditions m 
contemporary Austrian social science in terms of level of analysis, 
purposes, and data bases In terms of methods used in processing 
available data, one can identify the following as the most typical and 
frequently employed techniques 

1 The construction of simple typologies of available territorial units 
and the straightforward contrasting of resultant distributions of aggre- 
gated individual behaviors 

2 The construction of percentage indexes of selected behavioral 
phenomena and the straightforward contrasting of their tcmtonal 
distributions 

3 The comparison of paired maps contrasting temtonal attributes 
and aggregated behaviors, permitting visual cartographic correlations 

In most cases more sophisticated statistical techniques, such as partial 
or multiple correlation, or regression analysis, have not yet been used 
m handling quantified ecological data Moreover, most of the ap- 
proaches cited previously are desenptne They are concerned pnmanly 
with describing concrete vanations between terntorial units or agglom 
erations of individual behaviors which they regard as essentially unique 
But an important core group — those identified earlier as soaal, 
political, or urban ecologists — traditionally has been comparatnely 
and/or de\eIopmentaIly onenied In some cases these analysts have been 
concerned with the development of genera! propositions about the 
consequences of vanations m local community structures or m patterns 
of behavior of particular groups of individuals, m other cases they have 
attempted to construct general hypotheses about the sources and dimen- 
sions of change, and indeed they have developed a number of highly 


*' Thu approach to ecologrcal analysis has a heavy planning bias, that is, it is 
identified with offkia! goal-onented cfTorts on the part of go\-emnicntal anej admm 
istrative authorities to discover past trends and prevailing patterns of tcmtonal 
economic and social life m ord-r more effectively to antiapate and direct change 
Esen when using individual data regional and urban ecologists do so pnmanJy m 
order to charactenze better the areafs) under study 
Finally, we should note that some of the publications of the pnvate economic 
interest groups and their semipublic chambers as well as the Manaisbenchte zni 
Beilagen of the Institute for Economic Research, belong m this category of ap- 
proaches to ecological research in present-day Austria, despite the fact that they 
usually base their analyses on data aggregated for large units — the provinces or 
the national state 
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sophisticated models of various part processes of social change and 
modernization 

No doubt, the considerable expertise achieved b> past and present 
Austrian social scientists m the manipulation of masses of ecological 
data by "traditional methods has helped pave the way for a relati\cl> 
rapid and smooth transition to the age of data processing by electronic 
computers But what is urgently needed now is that high degree of 
bipartisan political CO operation which is absolutely necessary if younger 
Austrian social scientists are to have available esen the mitumum 
resources and independence requisite for that transition 
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The Future Development of 
Social Science Data Archives in 
Latin America 

The prescw report lists the principal obstacles to the development of 
modern social science data archives in Latin America, discusses the 
individual and institutional resources that may be used to improve 
existing structures and to create new ones, and gives a set of general 
principles to guide the work of the Standing Committee on Social 
Science Data Archives of the ISSCin thearea A general plan ofaction 
IS presented, together with a senes of more specdic steps 

PRINCIPAL OBSTACLES TO THE DEVELOPMENT OF SOCIAL 
SCIENCE DATA ARCHIVES IN LATIN AMERICA 

Each continental area presents a different set of problems and pros 
pccts m the field of social science data archives fn the United States 
the main challenge is the development of technical and intennstitutiona! 
systems to mzx\mize the avaiJabihly and uses of existing large data 
bases, many of which are already located in research centers The chief 
obstacles seem to be technical,* financial, and those derived from 
institutional jealousies and lack of co-operation 
I do not have a detailed knowledge of the siluation m Europe, but I 
surmise that the problems there are a muture of the same ingredients. 


• Ceneer for Social Reseanrh. Torruau Di Telia InHitute. BuenM Aiiw, 
Argenima. 

* Including human rcludance lo change iccaiufliies. 
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perhaps in differenl absolute and relative amounts, plus the resistances 
derived from national jealousies and lack of co operation 
The development of social science data archives in Latin America 
faces all these obstacles, plus others which, although not totally absent 
in the United States or in Europe, are here much more critical A partial 
list IS the following 

1 Financial problems 

2 Institutional isolation and misunderstanding 

3 National isolation and mistrust 

4 Lack of appropriate computing facilities and even of unit record 
equipment 

5 Scarcity of v.ell trained data archive technicians, including pro- 
grammers and other computer personnel 

6 Poor communication facilities This factor has to do with the 
technological and administrative substructure and is different from 
the institutional and national isolation and mistrust listed already 
(although it often reinforces both) Included here arc high relative 
mailing costs, low quality of mail and telephone systems, high relati>e 
costs of air and surface travel and transportation, low quality and high 
relative cost of the available secretarial help, etc 

7 Higher relati\e costs and, hence, higher subjective value of the 
data for their producers This factor makes for a highly possessive 
attitude on the part of many local social scientists (an attitude not 
totally absent m other continental areas) 

8 Low level of training and “traditional” attitudes on the part of 
some Latin-American scientists and most local administrators — pro- 
ducers of data and/or decision makers as to Ihcir availability to the 
scientific community at large 

9 Institutional instability m social sacnce research and training, as 
well as political instability affecimg the governmental agencies that 
produce and keep (or dispose of) many important sets of data 

10 The effects of extracontinental interferences* and of the exploit- 
ative attitudes of some non Latin American social researchers, which 
reinforce the xenophobia and mistrust of the local ones 

This IS not an unduly pessimistic view of things but rather a list of 
factors that have to be taken into consideration, as unavoidable facts 
of life, in any plans for the development of social science data archives 
m Latm Amenca 

* As in the case of Project Cameiot and similar ones see I L Horowitz (ed ) 
The Rise and Fall of Project Cameiot (Cambndge, Mass The MIT Press, 1967) 
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WDIVIDVAL Am INSTITUTIONAL RESOURCES 

On the other hand, some basic aspects of the situation permit us to 
plan from a more posm^e point of view There i$ m Latin America a 
growing body of social scientists trarned m modern techniques Com 
munications among them, although still unsatisfactory, arc steadily 
increasing Even though the institutional mortality is high, there is at 
present an increasing, albeit vaiymg and shifting, number of modem 
and efficient research centers that may be used as the core of our effort 
to establish, develop, and upgrade soaal science data archives through- 
out the area 

Moreover, the rate of change and improvement m the field of social 
science in many areas of Latin America is often quite high, and situations 
that at first ghnee seem hopeless have a surpnsing capacity to change 
rapidly for the better A good instance is the emergence of first-rate 
empirical research in several social sciences, such as economics and of 
modern institutions of research and training in a number of countries 
v>here nothing of the Xmd existed ten to twelve >ears ago 
In BogoiS, Colombia, m October 1967, more than forty mslilutes of 
research m the social sciences have constituted the Consejo Latmoamm 
cano de Ciencm Soaales (Latm-Aroerican Social Science Council) 


SOME GENERAL PRINCIPLES FOR THE DEVELOPMENT 
OF DATA ARCHIVES 

Our efforts m this area should be channeled, as much as possible, 
through Latin-American institutions Whenever possible these Latm- 
American centers should work vvilhm international or intcrinstitutional 
frameworks The Ulin-American Social Science Council is a good 
example of the desirable t>pe of (he multination. multi institution 
interdisciplinary setting , 

We must always take into consideration the local pecuhariiies and 
differences, acting not at the ’‘Latin-Amencan’ level which m this 
area is nonexislent, bar at the It'd of ‘h' th'MantW. anil 
the discipline and the inslilulion Pcisonal contacts arc in^spensaWe 
especially m the initial slajcs of out wort Letters and printed "talcnah 
should be used only as a complement to those person to ^ 
view, and never as their substirute We must 
m our effort with the necessity for a miwnum of lact. diplomacy, 
consideration for local idiosyncrasies sfiiiudes 

A strenuous effort should be made to avoid ^ ' f "f 

and negative competition among non-Ulin American msinutions On 
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the other hand, our action in promoting social science data archives 
must be explicitly based on principles of open and symmetrical co- 
operation, redressing as much as possible the effects of technical and 
financial disparities between local and “outside* ** institutions and pro- 
viding liberal access to data in every archive to all legitimate research 
centers 

As 1 said before, data are even more precious to Latin-Amencan 
researchers than they arc to Americans or Europeans Many Latin 
Americans feel that they should get “something** when parting with a 
copy of their files or cards Sometimes this will be the only way for our 
archives to obtain a significant body of data Moreover, such require- 
ments may be justified as a means to support further research, the pro- 
duction of more data, etc A list of some obvious items for such exchange 
should include funds for research, books, other data, fellowships 
(especially to tram social science data archive technicians), and tech- 
nical advice and assistance I shall come back later to some of these 
and related items 

At least in the initial stages of our progress, we should allow Latin- 
Amencan data producers’ to have exclusive rights to the use of their 
materials for longer periods than we would ordinarily deem reasonable 
in the United Slates or Europe In many centers of Latin America 
there is a “possessive” tradition that we cannot just wish away, on the 
other hand, it is objectively true that local researchers, with far fewer 
financial, technical, and human resources and often with less training, 
need more time for their analyses 

A GENERAL PLAN OF ACTION 

On the basis of these general principles, the mam objectives of a 
realistic program will be, m an approximately sequential order 

1 To diagnose in a specific and detailed fashion the situation within 
each region, country, and discipline 

2 To protect from destruction as many as possible of the existing 
sets of data 

3 To improve the existing archival facilities and to create new 
national and/or regional archives 

4 To develop new data sources and to incorporate new sets of data 
into the existing and new archives 

* Sur>ey directors social scientists and researchers who have gathered some type 

of statistical or historical matenal etc 
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Our most Dressing oter-all need is information about existin| situa- 
the access to them and decide on nc available 

■" "" 

specific situation i^, stable than their 

Research institutions in Latin ,hat 

counterparts in Europe or the ,s that the active 

the institutions disappear, in become unsuitable 

scholars and administrators lea« persecution, and 

as a research "^“ .“r .„ the same or another country 
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The first institution is downgrad^. ano 

the data gathered and left tehin ,„dividoal scientist leaves 

The same risk IS often present »hCTever .|o„e wolf,' the only 

an institution Quite often he has ,pp,rical research of a given 
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Whenever our „siy descnbed, we ^ p he fact that 
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aremaurebh.erlntAl.n-™’ 

I. these expenses an: 

• Like mosl Eoreeie 

than in IheUmteil Stares o 



586 


JORGE OARCIA-BOLTZA 


one example of the type of support that »e can offer for acquiring a set 
of data and presenting its destruction The only features that recom- 
mend It are (a) that by offering to pay the cost of punching the cards, 
»e put our hands on them at the earliest possible moment and avoid any 
subsequent vicissitudes they might suffer and (h) that the costs involved 
are minimal 

DEVELOPMENT OF EXISTING AND NEW ARCHIVES 

Perhaps the hardest decision for us, to be made on the basis of our 
diagnostic report, is whether to wait until a wide enough consensus is 
obtained in a given country or region or to choose one of a few promis- 
ing centers m each area and to concentrate in it our first efforts, trusting 
in some kind of “demonstration effect” to spread modem standards 
and techniques to the rest of the research and administrative agencies 
I tend to favor the second alternative This will mean the creation of 
“show-case” data archives in the most modern research institutions of 
Latin Amenca We shall then be able to organize seminars for top 
administrators from other centers within the region (or perhaps at a 
Latin-Amencan level) so that they will have first-hand information 
about the advantages of the new ^sterns, the support available for the 
upgrading of archival facilities provided minimal standards are met, etc 

For some time to come, the economic and technical problems of 
Latin-American archives will be those of rather small archives of 
heterogeneous material, including not only survey and ecological data 
but also historical data, etc We could use any experience our European 
and American colleagues have had with similar problems 

A substantial part of the assessment of each regional or national area 
— which starts with what I have called the diagnostic report and develops 
into a continued effort to update information about the data situation in 
each discipline — must be devoted to locate potential, unused new 
sources of data This will be coupled with an active program to gam 
access, gather, preproccss, and incorporate those new sets of data into 
existing or new archives 

Some of these programs may be included in ordinary research projects 
cither as one of their mam objectives or as an important by product 
In other cases, a speaal program will be needed, one that will not result 
immediately in analytical studies but will be devoted exclusively to 
gathering, cleaning, reclassifying, standardizing, and duplicating the 
new data Since the indispensable financial and technical resources are 
cspeaally scarce m the area, we shall have to support local institutions 
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or intcnnstitutional consortia Jo realize these tasks and create at a 
higher level — perhaps as a part of the Task Force on Social Science 
Data Archives of the Latin Atncncan Social Science Council — a com 
mittee to assign priorities among different potential data sets 
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469 473 483 

Party loyalty 463 467 476 
party sysK/n ,,, 

,1) Canada, 214 
multi-. 487 502 

regional 201 

(WO 488 497 354 

Periphery versos center 15 
36*,,, s28 231 

Pluralism *54 - . . 499 


Q technique 316 

Regression 333 

curviime*/ 3^ 

TO’ !"5 

E""”".." Wl"' “irSf 285- 
Rclgion 161 l87 2'5 " 

290 295 495 

R'P^Sr 24S4’ 230 J43 

4« 502 5W 5“j4, 549 
biiity 



606 
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Right iconunued) 
tn Ilsly, 236, 244-248, 252, 254, 
260-283 

Role social, 109 
Rotary Clubs, 158 

Rural area communities 95, 97, 103, 
158, 194. 196, 206, 212, 244, 
299. 351. 442 

Sample 

area-quota, 188 
composite, 536 
context-oriented, 123-124 
dispersed, 2J9 
oversampling 129 
national representative, 124 
of limited S12C, 29-30 
size of population in, 127 
Scheme, conceptual, 21 
Score (distribution) 
extreme, 375, 376 
mean, 221 

Secondary association. 159 
Segregation, 376 

Selectivity, homopolitical, 164, 185 
Simulation, see Analysis 
Size of locality, see Units of analysts 
Slum, 387. 388 

Social Credit Party (Canada), 190, 
204. 205, 213. 216 
Social Democratic Party 
in Italy, 240-242 
m Sweden, 309 
Socialist Parly, 7, 8 
m Austria, 574 
in France, 288-298, 473 
in Italy, 240-242, 250-283 
in Spam, 119 
Socialization pattern, 544 
Social scientists, communication 
among, in Latin Amenca, 

583 

Sociography, religious, 576 
Sociometry. 536 541 
Space-searching procedure, 165, 178 
Spatial structuring 157 
Standard deviation. 221, 374, 388. 
399. 545 

Status, socioeconomic, 162, 219, 221, 
228, 379, 386, 420, 425, 426 
444 

■see a}so Income 
Stereotype, national, 135 
Strikes 98. 239-252 


Structural clTcct, 53-68, 108, 119, 122, 

m. m, 126, 221 

Structure, interrelation between spa- 
tial and social, 556 
Subjective class identification, 119- 
120 

Suburb, suburbanism, 310, 343, 350— 
368, 389, 391 

Suburban-central city dimension, di- 
chotomy of, 344, 348, 350, 
355, 356. 360. 362. 363, 365, 
366 

see also Penphery versus center 
SojcJde. 9. JOS, 224 
Survey 

atomistic, 87 

contextual, 293 

cross regional, 89 

cross-sectional. 538 

ecological. 87. 88-89, 129, 537 

individualist. 88-89 

regional, 577 

within small unit, 571 

standard nation wide. 88-89, 233 

traditional. 130 

see also Data 

Tautology. 70, 74 
algebraic, 82 
in statement, 48 
statistical. 69 
Television, 238-283 
Themes, perennial, 30, 32-33 
Theory 

construction of, 32 
heunsUc, 25 
learning, 185 
specification of, 25 
substantive, 32 
vcnfication of, 22 
Thinking pragmatic, 25 
Thomism 32 
Time dimension, 534, 542 
Totalitarian society, 231 
Trade union membership, 236 
Transplantation theory, 360, 362, 

368 

Turnout, electoral, 158, 344, 357, 363, 
484 

Turnover of candidates 469 
Type ideal polar normal 514 
Typology viii, 6, 7, 234, S61 
of areal units, 10 
of dwelling areas 9, 577 
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TypoJo^ {continued) 
ccofogicnl, V. 299 
of ecological factors. 446, 447 
of fallacies, see rollacy 
of London distneta, 371-395 
of meiropolitan communities. 414, 
44 S 

of metropolitan ecotofifcal stnic- 
turc, 449 

of political constellation. 234 
of political domains, 54S 
of social structures. 127-129 

Ukranian minority (Canada). 

212 

L/niti of anal>sis 
heterogeneity of. 375. 376 
homogeneity of, 331, 382-3S3 
selection of. 332 
size of. 148-155, 330 
standard geographical. 575 
Urban community, 299 
Urban development 559 
Urban planning. 520 
Urban renewal. 562, 564 
Urban research, 559, 560 
Urban-rural comparison, 498, 509. 
513 


Ufbafl spatial structures, 302, 313, 
331 


rlhoocentric. 80 
social 362 
subcultural. 228 
Vanablc 
crucial 26 

psychological. 101, 102 
selection of, 191, 326, 328, 338 
Variance 

ccologfcal. 54 55, 61 
residual. 234 
Vanance component, 76 
Vanmas (solution in (actor analyos) 
entenon, 310 
Varimax dimension. 336 
Vanmas normal position, 352 
Vanmax prmaple 302. 309, 312, 313 
Vanmax rotation, 306, 419, 426, 504 
Vector mean, 384 
Violence, poliiical. 24. 26 
Voluntary organuaiion 160, 169, 218, 
231 

Working class. 288. 289 


Xenophobia, 582 



